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Abstract 

This dissertation studies the statistics and modchng of a quantum system probed by 
a coherent laser field. We focus on an ensemble of qubits dispersively coupled to a 
traveling wave light field. The first research topic explores the quantum measurement 
statistics of a quasi-monochromatic laser probe. We identify the shortest timescale 
that successive measurements approximately commute. Our model predicts that for 
a probe in the near infrared, noncommuting measurement effects are apparent for 
subpicosecond times. 

The second dissertation topic attempts to find an approximation to a conditional 
master equation, which maps identical product states to identical product states. 
Through a technique known as projection filtering, we find such a equation for an 
ensemble of qubits experiencing a diffusive measurement of a collective angular mo- 
mentum projection, in addition to global rotations. We then test the quality of the 
approximation through numerical simulations. This measurement model is known 
to be entangling and without the rotations we find poor agreement between the ex- 
act and approximate predictions. However, in the presence of strong randomized 
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rotations, the approximation reproduces the exact expectation values to within 95% 
accuracy. 

The final topic applies the projection filter to the problem of state reconstruc- 
tion. We find an initial state estimate based on a single continuous measurement 
of an identically prepared atomic ensemble. Given the ability to make a continuous 
collective measurement and simultaneously applying time varying controls, it is pos- 
sible to find an accurate estimate given based upon a single measurement realization. 
Previous experiments implementing this method found high fidelity estimates, but 
were ultimately limited by decoherence. Here we explore the fundamental limits of 
this protocol by studying an idealized model for pure qubits, which is limited only 
by measurement backaction. This ultimately makes the measurement statistics a 
nonlinear function of the initial state. Via the projection filter, we find an efficiently 
computed approximation to the log-likelihood function. Using the exact dynamics to 
produce simulated measurements, we then numerically search for a maximum like- 
lihood estimate based on the approximate expression. We ultimately find that our 
estimation technique nearly achieves an average fidelity bound set by an optimum 
POVM. 
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Chapter 1 
Introduction 



Within the past three decades, the abihty to engineer individual quantum systems 
into highly nonclassical states has become a reality. The fundamental technology 
that facilitated these revolutionary experiments is the coherent laser with its ability 
to address specific electronic transitions in matter. A quasi-monochromatic laser can 
also introduce optical forces on position degrees of freedom. While initially used to 
laser cool and trap atoms, coherent electronic superpositions can also be transferred 
to external superpositions allowing for atom interferometers [1] or highly nonclassical 
states in trapped ions [2, 3]. 

In addition to providing control of an atomic system at a quantum level, the same 
laser systems can be used to measure the atomic state of the system. The simplest 
of all detection methods is resonant fluorescence, where laser light resonant with a 
single transition is applied to an atom, which will scatter photons if that level is 
occupied. However, if the internal state is in a superposition between the resonant 
level and an additional off-resonant 'dark' state, the presence or absence of scattered 
light provides information about the internal state of the system to the experimenter 
[4]. 

The quantum nature of the atom-light interaction carries over to off resonant 
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applications. In a low intensity regime, a free space laser with a carrier frequency 
significantly detuned from an atomic transition predominantly induces a state de- 
pendent energy shift without significantly exciting that transition. The specific form 
of the interaction also depends upon the polarization of the exciting laser. In a given 
parameter regime the resulting Hamiltonian dominates over the decoherence from 
absorption and subsequent emission, resulting in a controllable coupling between the 
atoms and the polarization of the probe laser [5]. This coupling affects both the 
atomic and polarization quantum states. The fact that the laser is a traveling wave 
means that this is a fundamentally open quantum system and the output state of 
light carries with it some information about the atomic state. 

Quantum mechanics is at its core a probabilistic theory where the wave function 
is a tool for computing the probability of observing experimental events. Upon the 
receipt of a measurement outcome, an accurate description of the quantum system 
must reflect this new information. This is true in an idealized projective measurement 

or in indirect measurements like those described above. However there arc significant 
differences between the method of state detection via resonance fluorescence and off- 
resonance polarization spectroscopy. 

In fluorescence detection a vast majority of the scattered light is ultimately lost, 
either because only a small fraction of possible emission directions are observed or 
due to losses in the detection apparatus. It takes a considerable experimental effort 
to measure as httle as 5% of the total scattered light from a single trapped ion [6, 7] . 
A single ion will have to scatter a lot of light in order for an experimenter to be 
able to discriminate a bright state from a dark state with any reasonable confidence. 
This means that after a relatively short time period it is likely that an ion prepared 
in a superposition of bright and dark states has scattered several photons that were 
lost to the experimenter. Honest scientists would be forced to admit that while they 
were still uncertain as to the outcome of the measurement, they are quite certain 
that any coherence between the bright and dark states has been destroyed. 
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In the off-resonant sclieme, almost all of the probe light can be collected, mean- 
ing that a clever experimentalist has access to nearly all of the information available. 
After a short interaction time the measured state will in general change, but only 
in proportion to the amount of information gained. The point is that armed with a 
complete measurement record it is possible to track the evolution of the state from 
an initial superposition to a final projected outcome. This kind of measurement is 
known as a weak quantum nondemolition measurement (QND), and has been demon- 
strated in several different experiments involving ensembles of monatomic gasses in 
various parameter regimes. One important consequence of this kind of measurement 
is that the projective outcome is a highly nonclassical state involving strong quantum 
coherence between all of the atoms in the ensemble. While the experimental realities 
of photon scattering ultimately limit the system from reaching this eigenstate, an 
intermediate squeezed state has been observed on several occasions [8-10], where 
the uncertainty of the measured observable is reduced below the standard quantum 
limit. 

This dissertation is focused on the modeling of a quantum atomic system inter- 
acting with a quantized traveling-wave optical probe when that field is also measured 
continuously in time. The most interesting quantum effects occur in the idealized 
case with no loss of light and a noise free measurement, which is the only case consid- 
ered here. The progression from an initial superposition state to a final measurement 
eigenstate is neither a time stationary process nor a linear transformation and so a 
model capable of tracking the full transition must be both time-adaptive and non- 
linear. Finding such a mathematical description is not a trivial exercise but one that 
has been extensively studied previously. 

The ultimate objective is to apply this continuous measurement model to the 
problem of quantum state tomography. Constructing an estimate for an arbitrary 
quantum state based upon experimental data is very resource intensive. Specifying 
an arbitrary quantum state for a d-dimensional system requires at most (P — 1 pa- 
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rameters and for each parameter A^^ uncorrelated projective measurements generally 
gives an accuracy of \/N. Through an alternative protocol proposed by Silbcrfarb 
et al., these inefficiencies can be largely side-stepped, by applying a weak continu- 
ous measurement plus a well chosen dynamical control collectively to an ensemble 
of identically prepared systems [11]. If the control drives the system in such a way 
as to make the measurement informationally complete, then a single measurement 
record has encoded information about all d? — 1 parameters, albeit with a varying 
level of certainty and noise corruption. 

In particular, we consider an atomic ensemble prepared in an identical tensor 
product state ptot = Po " ^^^^ experiences a known Hamiltonian while simultaneously 
coupled to a traveling wave probe via a collective degree of freedom. A continuous 
measurement of this probe then generates a measurement record that is strongly cor- 
related with the evolution of the system. With sufficient signal to noise, a statistical 
estimate of an unknown initial system state will in general have high fidelity with the 
true initial condition. Using the weak measurement generated by an off-resonance 
probe, this state reconstruction procedure has been implemented in the laboratory, 
allowing reconstruction of the full hyperfine, d — 16, ground state manifold of a 
laser cooled neutral Cs atom ensemble [12, 13]. However, these experiments were 
performed in a parameter regime where the amount of information lost to the en- 
vironment dominated over any measurement induced backaction. Chap. 5 explores 
how this procedure performs in an opposite regime where dccoherence is negligible 
and we retain a complete measurement record. Arriving at this result requires sev- 
eral intermediate steps, particularly a detailed knowledge of how a classical statistical 
estimate is made and how that is applied to a quantum system. 

The estimation of a possibly random signal from an observation corrupted by 
unwanted noise is known as filtering and takes its origin from the work of Wiener 
[14], where the signal was assumed to be generated with time stationary statistics. In 
a linear system with additive Gaussian noise, an optimum estimate to a continuous 
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nonstationary signal was computed by Kalman and Bucy [15] and is an indispensable 
tool in engineering and classical signal processing and control. Not surprisingly an 
estimate to a nonlinear signal is significantly more challenging than in a linear system. 
The nonlinear classical filter began with Stratonovich [16] and was later expressed 
in the useful language of Ito calculus by Kushner [17]. Important contributions were 
made by KaUianpur and Striebel [18] and Zakai [19], steps that are particularly useful 
in formulating a quantum analog. Nonlinear filtering theory has an extremely wide 
range of applications including GPS based navigation, optimal stochastic control, 
financial portfolio optimization, audio and imaging noise removal and enhancement, 
speech recognition, weather prediction, and so on [20]. For each application there 
is a rich body of literature with a wealth of numerical and approximation methods 
allowing for practical and, in some cases, real-time implementations. 

One of the fundamental tools that makes continuous-time classical estimation 
possible is stochastic calculus. In the same way that a global function can be built 
up from an integral over local infinitesimals, a random signal can also be constructed 
from random increments. In order for the filtering problem to be remotely tractable, 
it is necessary that the random increments originate from an uncorrclated noise 
source. This means that the fundamental noise injected into an otherwise determin- 
istic system is assumed to be uncorrelated white noise. Under this assumption the 
filtering equation is Markovian, meaning that the estimate updates only according to 
the latest measurement and its most recent value. Classically, a white noise approx- 
imation is often well justified when the input noise is actually an aggregate effect 
from a large number of uncorrelated sources. The canonical example is a particle ex- 
periencing Brownian motion. Each impulse is a collision with a background molecule 
imparts a small amount of momentum to the larger particle. For times that are longer 
than the time between collisions, the net displacement is uncorrelated with previous 
intermediate times. In this case not only are the collisions uncorrelated, the parti- 
cle's displacement is Gaussian distributed with a variance that grows proportional 
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to time. Brownian motion is a classic example for a system influenced by Gaussian 
white noise, but due to the central limit theorem, these models are ubiquitous in 
systems with continuous trajectories. 

Working with white noise directly adds another layer of sophistication to an 
already mathematically challenging topic [21]. This is due to, among other problems, 
the fact that as white noise is defined to have completely uncorrelated fiuctuations at 
any point in time, it is therefore discontinuous; continuity from one point to the next 
would imply correlations. To build a mathematical framework that is both useful 
and provably consistent, the hard learned lesson is to frame the problem not in terms 
of the white noise itself but to instead use its integral, which is at least continuous^ 
[22]. The most widely used representation for an integral over Gaussian white noise 
(in other words a mathematical model for Brownian motion) is the Wiener process. 
Chap. 3 reviews many of its defining properties, which we are able to leverage into a 
maximum likelihood estimate of an initial quantum state based upon a polarimetry 
measurement. 

Beyond a successful model for integrated white noise, a classical filter needs to 
be able to manipulate these integrals in a full fiedged calculus. The subtlety of 
dealing with a randomized integral is that different limiting approximations lead to 
fundamentally different stochastic calculi. The two most common forms of stochastic 
integration are the Stratonovich integral and the ltd integral, with differing calculus 
rules and statistical properties [22]. Both forms of integration are used here and a 
brief review is presented in Appendix B. One drawback from initializing a model 
with a stochastic integral is that any predictions will dramatically depend on what 
kind of integral is used. At one level the choice of integral is no different than a 
number of other approximations one makes in formulating a statistical model of a 
given physical system. 

^The calculus of randomized distributions characterizing white noise is referenced back 
to the integrated expressions anyway [21]. 
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The lessons from developing a stochastic calculus for classical systems has also 
been applied to quantum models as well. In the mid 80 's Hudson and Parthasarathy 
developed an operator- valued quantum version of the classical Ito calculus [23] . This 
quantum Ito calculus is indispensable in modeling open quantum optical systems 
and has been applied to not only continuous quantum measurement [24, 25], but 
also quantum control (see [26] for an overview and introduction). 

The Ito integral is based upon the assumption that the noise is completely un- 
correlated so that the integral between times [to,ti) will be independent from the 
integral over times [ti,t2) for any times, < to < < h, no matter how small 
the difference. For an actual Brownian particle, this is only an approximation as 
the particle's velocity is correlated for times between atomic collisions. The quan- 
tum ltd integral also makes such an assumption but it does so by assuming that 
the operators representing equivalent integrals commute for nonoverlapping times, 
no matter how small. Before immediately applying Hudson and Parthasarathy's 
formalism to the laser probed system. Chap. 2 investigates for what times such an 
approximation applies, given that the resulting operators must also be consistent 
with a quasi-monochromatic description of a traveling wave light field. 

The similarities between classical filtering and a quantum system subject to an 
indirect measurement should not be ignored. In a classical setting one seeks an es- 
timate of an unobserved system state consistent with a noisy measurement. The 
fundamental goal in a quantum system is to predict the results of future measure- 
ments consistently and accurately with past measurements. The unobserved atomic 
system is then the estimated quantity and the measured probe gives the noisy data. 
To fully exploit this similarity and apply the techniques developed for classical sys- 
tems, several fundamental questions about the nature of quantum statistics and 
classical probability must be addressed. 

The development of a continuous-time filter for a quantum system was pioneered 
by the work of Belavkin starting in the early 80's [24, 27-29]. These mathematically 
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rigorous results developed and applied a deep relation between the algebraic and 
commutative properties of operators on a Hilbert space and the expression of clas- 
sical stochastic processes. Experimental observations in elementary quantum theory 
are represented by Hermitian operators and that upon making a measurement the 
random outcome corresponds to an eigenvalue of that operator. The connection is 
then made though the following two insights. The first insight is then to associate 
operators with random variables. In classical probability all random variables are 
consistent, in the sense that all random variables will agree that the same under- 
lying outcome of the system occurred, no matter what order they are queried. For 
quantum systems it is a hard learned fact that only commuting operators will return 
consistent results. Thus, the second insight is that in order to use a sequence of 
measurements for statistical inference, all of the measured operators must commute. 
Additionally, any operator whose statistics we wish to infer must also commute with 
all measurements to date. The utility of considering sets of commuting operators for 
the purposes of statistical inference is more well known in the physics community as 
the defining property of QND [30]. Working within these limitations, the problems of 
noncommutativity is no longer an issue leading to a real and useful mapping between 
quantum measurements and classical probabilities. 

A mapping between the quantum and the classical descriptions of probability 
can be more than just a guiding principle. Through a formal isomorphism between 
commuting quantum operators and the language of classical filtering theory all of 
the above classical results can be easily applied. The quantum filter developed by 
Belavkin is nothing more than a noncommuting analog of the classical Kushner- 
Stratonovich equation of nonlinear filtering [25]. Chap. 3 reviews how a mapping 
between quantum operators and a formal classical probability model is made. The 
purpose of this review is two fold. The first is to provide the necessary background for 
a quantum filter. The second is to shed light on how the algebraic language of classical 
probability theory can be applied to quantum systems thereby gaining new insights 
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and intuitions into the quantum/classical divide. When the quantum and classical 
coincide, nearly 50 years of engineering experience can either be immediately applied 
or adapted with some modifications. Finally, this chapter shows how the quantum 
filter is equivalent to a generalized measurement by making a unitary extension to a 
larger dimensional Hilbert space. 

Chap. 4 applies one such method to the quantum system of an identical spin 
ensemble undergoing a polarimetry measurement in idealized conditions. Brigo et al. 
applied the methods of differential geometry to simplify a classical filter [31, 32]. 
This method of making differential projections was adapted to a quantum system by 
van Handel and Mabuchi where they simplified a continuous quantum measurement 
of a strongly driven atom-cavity system into manifold states where the cavity has a 
Gaussian Q-function [33] . The method has been subsequently applied to other cavity 
QED systems [34-36], collective spin systems in a hnearized-Gaussian regime [37], 
and to find a low rank approximation to a general master equations in Lindblad form 
[38]. Chap. 4 computes the orthogonal projection of an ensemble of n qubits into 
the manifold of identical separable states of the form p^'^ and numerically compares 
the accuracy of such an approximation to a complete evolution. 

Using the projected filter as a computational tool. Chap. 5 turns the problem 
of quantum state tomography essentially into a classical parameter estimation prob- 
lem, where the classical parameters are the pointing angles of a spin coherent state 
constructed from the initial n qubits. The parameters are estimated by numerically 
computing a maximum likelihood estimate based upon a polarimetry measurement 
when the measurement statistics are strongly affected by quantum backaction. By 
only including the conditional effects present in the projection filter we achieve an 
average reconstruction fidelity that nearly saturates an optimum bound given by any 
generalized measurement scheme [39]. 
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1.0.1 A note on quantum foundations 

Any work that addresses the quantum world and in particular quantum measurement 
eventually encounters some issue rooted in the foundations of quantum mechanics 
and the various interpretations one could assume. This dissertation does not address 
quantum mechanical foundations in any meaningful way and attempts to remain 
agnostic about the reality of a quantum state or even the existence of a more fun- 
damental theory. Wherever possible we take a statistical perspective and implicitly 
assume that the simple models we construct may not be error free, in the sense that 
they do not include the whole the reality of a given experiment. 

When considering quantum state tomography, we compare the conditional evo- 
lution of an ensemble of initial conditions and then select the state that maximizes 
a likelihood function. In our numerical simulations, no member of that ensemble 
corresponds with arbitrary precision to the initial condition used to simulate the 
measurement record. So in one sense the conditional state we calculate will always 
be incorrect. However, in a field where the ontology of a quantum state is still de- 
bated, we take a conservative position and will not to presume to know that any 
conditional state is the true conditional state. Instead we will only take the stance 
that what we calculate is a quantum state that best predicts any future measurement 
in a manner that is consistent with past results and the assumptions of the model. 
We identify this state through the framework of quantum probability theory, a for- 
malism that is less well known to physicists working in quantum information theory. 
The final object that we calculate is ultimately no different from what is given by 
the usual stochastic Schrodinger/master equations that are used in quantum optics. 

The purpose of working with quantum probability theory is that it illustrates an 
immediate connection to classical probability and estimation theory. In the classical 
setting, an estimator is a tool that is used to predict or estimate some quantity given 
a series of measurements. The stochastic Schrodinger equation is in a very real sense 
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a quantum estimator. We would rather not comment as to whether or not it is 
estimating the state of the system because it lacks knowledge of a theory extending 
beyond standard quantum mechanics or if it is the fundamental limit and there is 
no more information in existences. In effect, we assume that the quantum state is 
simply a tool for making predictions about a quantum system. 



1.1 An executive summary 



This is a terse summary of the fundamental results of this dissertation, presented 
in the same order as the subsequent chapters. This is not intended to be a gentle 
introduction to the material and assumes a strong familiarity with the background 
material. We encourage an interested but nonexpert reader not to struggle too 
hard trying to comprehend this section and instead consult the main text and the 
associated appendices. 

If any single global thesis can be applied to the entirety of this work it is that clas- 
sical probabilistic methods are useful and that with some care they can be adapted 
to quantum systems. The previous introduction discussed how to connect stochastic 
calculus and nonlinear filtering theory to a quantum system continuously probed by 
an optical field. Chaps. 2 and 3 provide a physical and mathematical foundation for 
this connection while Chaps. 4 and 5 apply it to the specific problem of efficiently 
estimating an initial qubit state. Chap. 6 discusses possible directions this work 
could take. In addition to this main matter, we include several appendices providing 
background material such as a review of paraxial optics (Appendix A), stochastic 
differential equations (Appendix B), quantum stochastic differential equations (Ap- 
pendix C) and the quantum Wong-Zakai theorem (Appendix D). 
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1.1.1 Quantum optics and quantum stochastic differential 
equations 

Chap. 2 shows how a second quantized picture of classical traveling wave packets 
reproduces the mathematical structure necessary for defining a formal quantum Ito 
stochastic calculus. It also identifies the timescales for which a quasi- monochromatic 
field can be approximated as generating quantum white noise. This is a regime that 
is independent from any system coupling or measurement apparatus and applies for 
a large family of states - include highly nonclassical states, such as multi-mode Fock 
states. 

The specific model we consider is the second quantization of quasi-monochromatic 
wave packets [40-42] where the single particle Hilbert space is the space of coherent 
state amplitudes for an associated classical field. We assume a paraxial model where 
there is a factorization between a carrier plane wave ex.p{—iuJo{t — z/c)), spatial 
mode function uip \x, y, z), and longitudinal envelope function f(t — z/c). The quasi- 
monochromatic approximation means that the longitudinal function must satisfy the 



We ultimately seek creation and annihilation operators that are simultaneously 
quasi-monochromatic as well as consistent with a quantum white noise approxima- 
tion. To do so, we define a^[f (0)] to be the operator that creates a single quantum in 
a given spatial mode u^\x,y, z), with an envelope function /, referenced to some 
point along the optical axis. The operator a[f (i)] annihilates a quantum in a similar 
mode but one that has experienced free propagation for a time t. We derive the 
unequal time commutation relation. 



a[f(ti)], at[f(t2)]] oce--°(*-*i) (/*/ {t, - t^) - t-f^ ^ f {t^ - t,)) (1.1) 



where g ~k f is the cross-correlation function of g and / and the proportionality 
constant is simply a scaling factor that can be absorbed into the definition of /. 



inequality > 



— I — 
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Physically it is entirely reasonable that if a classical envelope is no longer temporally 
correlated then the associated field operators should commute. To the best of our 
knowledge this is a new result in the characterization of quantized fields. 

The canonical definition for quantum white noise is that there exist the creation 
and annihilation operators [a{t) , a'' {t')] oc 6{t — t'). Therefore in order for quasi- 
monochromatic light to be consistent with a white noise approximation, not only 
does / ★ / (^2 - ^i) S{t2 - ti) in a suitable limit but * / (^2 - ^i) ^ 0. 

From an approximation to white noise, (in a rotating frame) we then use the 
limiting white noise operators and a recent theorem by Gough [43], reviewed in 
Appendix D, to consider the dispersive Faraday interaction, in an idealized regime 
where the possibility for multiple scattering events is nonnegligible. The limiting 
object is a quantum stochastic Ito equation for the propagator that describes the 
unitary evolution between the field and the atomic system. Using well know results 
in quantum stochastics we write down the equivalent master equation in Lindblad 
form. 

1.1.2 Classical and quantum probability theory 

Chap. 3 is a mathematical review of well known results from classical and quantum 
probability theory, which serves as a foundation for the novel work in later chapters. 
This review is conducted with an emphasis for physicists and attempts to explain 
and justify the concepts while omitting the proofs. The end goal is have an under- 
standing of how the conditional master equation results from a mapping between 
sets of commuting operators and a classical probability space. This is a critical point 
as the driving noise in the conditional master equation is not a Wiener process, but 
is instead the random outcomes of a continuous quantum limited measurement. The 
resulting classical stochastic process {yt}t>Q is only a Brownian motion when the 
measurements are (i) of a field quadrature in the vacuum state and (ii) there is no 
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system coupling to that quadrature, i.e. the measurement has no system information. 

The second objective of this chapter is to emphasize the general power of this 
technique and to discuss how the language of classical probability theory can be used 
to identify semiclassical subspaces embedded in a quantum system. In order to do 
so in a relatively self-contained manner we review the basic elements in the triple 
(QjJ*, P) forming a classical probability space. The infinite dimensional example 
we focus on is the sample paths for a Brownian motion and explicitly describe the 
relevant cr-algebra. In order to introduce the quantum conditional expectation, we 
first review the classical conditional expectation and more generally how expectation 
values are computed in the measure theoretic framework. We then introduce the 
concept of time-adapted processes and martingales as both are crucial in the quantum 
case. The review of classical probability theory concludes by discussing the Wiener 
process and the Wiener measure over the space of continuous functions. 

Prom a firm description of classical probability theory we then discuss the quan- 
tum analog. We explicitly show how one identifies a classical probability space from 
the set of mutually commuting observables by taking Q as the set of possible eigenval- 
ues, J-" as the (j-algcbra generated by those eigenvalues, and the probability measure 
P as the quantum expectation under the state p of the associated projectors. From 
this semiclassical description we then introduce the noncommutative analog were 
one omits a sample space of compatible outcomes, identifies the cr-algebra with a *- 
algebra of operators (or a von Neumann algebra in the infinite dimensional case) , and 
the probability measure with a valid quantum state p. We then explain the power of 
generating sub-*-algebras from sets of operators focusing on the important object of 
the commutant. We specifically explain how the commutant is the largest space of 
operators that we can condition on a sequence of commuting observations and how it 
contains noncommuting elements. Armed with that description we identify the prop- 
erties of the quantum conditional expectation, and provide an explicit construction 
for how it is in correspondence with the generalized measurements found in quantum 
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information theory. From the discussion of the quantum conditional expectation we 
then state the rcsuhing the quantum filter as it is generated from the observation 



While the quantum filter is an elegant expression for a conditional operator, it 
rarely closes to a finite set of quantum stochastic differential equations. Rather it is 
more useful to work with an effectively semiclassical equation, the conditional master 
equation. Here we use the term semiclassical in a sense that does not imply a subop- 
timal approximation but rather to indicate that the quantum measurement process 
{^t}t>o family of operators) is treated as a classical stochastic process {yt}t>o 
family of classical random variables defined on the probability space {fl, J^, P) ). The 
probability measure P matches the statistics of {yt}t>o to the quantum measurement 
statistics {Yt}^^^^. While this mapping "demotes" the measurement operators to a 
classical process, it still treats the system quantum mechanically, by propagating a 
density operator {pt}t>o- Generally the statistics of {yt}t>o ^^^^ depend upon a quan- 
tum system expectation value, and so this is semiclassical and not a fully classical 
probability model. This system density operator matches the quantum conditional 
expectation by enforcing the equality 



for every system operator X and time t. 

The quantum filter is derived in terms of a quantum Ito equation and so the 
resulting semiclassical conditional master equation is a matrix-valued classical Ito 
equation. In Chap. 4 we are required to express it in terms of a Stratonovich 
integral and so we derive the associated correction factor here. The chapter closes 
by finding a conditional Schrodinger equation that corresponds to the more general 
master equation in the case of pure states. This equation is useful for numerical 
simulation as propagating a complex vector is more efficient than a complex matrix. 




under vacuum expectation. 



(1.2) 
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1.1.3 Projection filtering for qubit ensembles 

Chap. 4 derives an approximate form of the conditional master equation for an 
ensemble of n qubits under the assumption that the state will remain nearly an 
identical separable state. The approximation is made though a technique known as 
projection filtering, developed to reduce the dimension of a classical filtering equation 
by formulating the space of solutions as a Riemannian manifold and then making 
an orthogonal projection onto a lower dimensional manifold. The lower dimensional 
manifold that we wish to project onto is the space of density matrices that can be 
written as g = p®" for some vahd single qubit state p. The appeal of the projection 
filtering technique is that it is algorithmic in nature, in that after identifying the 
desired manifold and making a choice of metric, finding the optimal projection is 
reduced to a problem of matrix algebra. Due to the simplicity of the qubit we are 
able to solve for this projection analytically. 

A third of this chapter reviews the fundamentals of differential geometry, focusing 
on the mapping between qubit states and the Bloch ball. We refer to the set of 
valid quantum states for a d < oo dimensional quantum system as S{d) and the 
three-dimensional unit ball as B. The metric we use is the trace inner product 
{A, B) ^ = Tt[A'^B) for A, B E TgS{d). From the standard mapping between points 
in the Bloch ball and qubit states p : B CR^ ^ S{2) 

p(x) = 1(1+ x'ai) (1.3) 

we identify a basis {-Dj = ^cTi} for the tangent space Tp(x)5(2). The resulting trace 
inner product induces an Euclidean metric on B, 

(A,^,), = iTr(a,a,) = i%. (1.4) 

The manifold we ultimately want to consider is the set of density operators 



V = {p(x)®" : X e B} c 5(2"). 



(1.5) 
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Any derivative we define for ^ G P must distribute over tfie tensor product structure, 
and so we identify tlie tangent space 

T,(x)P = span I A(x) = ^^(x)^^-^ ® ^a, ® p(x)®'*-^| . (1.6) 

For n 7^ 1, the metric on the Bloch ball induced from the trace inner product is no 
longer Euclidean. Instead it is given by the matrix 

^,,(x)=Tr(A(x)Z^,(x)) 

( 1^ (1-7) 

= ^ (1 + 1^1 ) + \n (1 + |X| ) x'^xHuhr 

This metric is however isotropic, which can be seen by converting to spherical coor- 
dinates. The resulting line element is 

= 1^ (1 + ry-^ (yt? "^^^ + "^^^ + ''''' ^ '^'^O • ^^-^^ 

With this non-Euclidean metric, we then wish to apply the projection map Hp : 
T^Si^T) TqV, defined as 

np(X)=5^^(x)(D,(x),X)^ A(x) (1.9) 

to each terms in the conditional master equation. 

A general unconditioned master equation written in Lindblad form is, 

y^-i{E,Q\-rV{L\{Q) (1.10) 

for some Hamiltonian H and jump operator L. As the master equation describes a 
valid quantum evolution, the righthand side of this equation must describe a vector 
in the tangent space T'g<S(2"). Applying the projector II-p to the general master 
equation results in a new master equation, describing the evolution of a modified 
state 



=^^^-(x)((i^,(x), -i\E, ^(x)])^ + (i^,.(x), V\L\{q))^ A(x). (1.11) 
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This new equation is guaranteed to both produce a vahd quantum evolution as well 
as constrain the state to remain in V. Performing this projection in the case of a 
conditional master equation is essentially no different, with one caveat due to the 
subtle nature of stochastic integrations. 

Converting the derivative ^g\r into a differential form dp\p has no ambiguity in 
interpretation as the differential 



describes a valid mapping between the tangent space Tf]R+ and the tangent space 
Tq[-x_)P. However, interpreting a general stochastic differential in terms of a differen- 
tial form is problematic because the explicit path-wise derivative generally does not 
exist. Even if one were to solve the stochastic differential equation 



for -B(xt) e Tg^xt)'P there is no a priori reason to assume that the resulting solution 
will remain in V. In developing the projection filtering technique, Brigo et al. found 
that a solution to a general Ito equation decidedly does not satisfy this property 
[31]. The problem is that the drift induced by the second order nature of the Ito 
rule causes the solution to leave V even when the integrand is in the proper tangent 
space. The saving grace is that the orthogonal projection method does constrain the 
solution when the original equation is written in Stratonovich form. The bottom line 
is that in order to project the conditional master equation into the tangent space 
T^V it must first be written as a Stratonovich integral. Chap. 3 calculates the 
proper conversion factor, generating the Ito correction map Ic[L]{g) for a general 
measurement operator L. 

The conditional master equation is given by the Stratonovich equation 



dg = -i[i/tot, g]dt + V[Ltot]{g)dt + Ic[Ltot]{g)dt + H[Ltot\ig) o dvt (1.14) 



dg\p = a*(x) Di{x) dt 



(1.12) 



dg\-p = S(xt) dwt 



(1.13) 
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where the maps X>[Ltot](-)) ^[-^K")) Xc[L](-) are given in eqs. (4.5, 4.6, and 4.10) 
respectively. The subscript tot used here is used to specify that these operators act 
on the joint Hilbert space over all n qubits. 

The projections of each term are computed relatively generally, but under the 
assumption that the operators iftot and Ltot act identically and independently on 
each qubit in the ensemble. This means that for the single qubit operators H and L 
the joint operators are equal to 

n 

Htot^^l^^'^^H^l'^''-^ (1.15) 
e=i 

and 

n 

e=i 

Prom the general expressions we also specialize to the examples of L = y/n \(7z and 
H — ^{f^{t)ax + P{t)(Jy + p{t)az) for a constant rate k and deterministic real valued 
control fields /*(t). This specialized example corresponds to an idealized model of 
a dispersive measurement of a collective angular momenta and a time varying but 
uniform magnetic field. The final expression we calculate for this example and call 
the projection filter is a system of coupled Ito stochastic differential equations that 
correspond to the single particle Bloch vector components, Xj, 

dxt — a^(x, t) dt — y/KX z dvt, 

dyt — a^{x.,t) dt — y/Hy zdvt, (1-17) 
dzt = a^(x, t) dt + \/k(1 ~ ^^) dvt- 

The deterministic integrands a*(x, i) are 

a\^,t) = f{t) z - f{t)y - \kx + K^{r)x z\ 

a2(x, t) = f{t) x-f\t)z- \ny + n-i{r) y z\ (1.18) 
a3(x, t) = f\t) y - fit) x-n{n-l) (i^) z - k^{t) z' 



Chapter 1. Introduction 



20 



with the function 

Finally the stochastic increment dvt is the innovation process, calculated from the 
measurement process (no relation to the Bloch vector component) with a differ- 
ential 

dvf — dyt — n^/nzt dt. (1-20) 



The nonlinear function 7(r) has two important zeros that simplify the projection 
filter dramatically. The first is that when n = 1, 7(r) = for every value of r. 
Furthermore it is easy to compute that when evaluating the projection filter for 
n — 1, the equations are identical to a set of conditional Bloch vector equations one 
obtains directly from the conditional master equation. In other words, the projected 
space is the whole manifold of solutions, V — S{2). The second zero occurs for 
7(r = 1) = for any n. The n and r dependent terms in a^(x, also evaluate to 
zero for r = 1, meaning that for any n the projected pure state evolution is essentially 
identical to the evolution of a single qubit state. The only remaining n dependence is 
that the innovation process requires the expected measurement outcome to be scaled 
by a factor of n. 

There are three elements that makes this projection filter tractable for obtaining 
an analytic expression. The first is the isotropic nature of trace inner product metric, 
dramatically simplifying the calculation. The second is that the Pauli matrices form 
a simple basis for 2 x 2 complex matrices and have equal eigenvalues. The third 
is the identical and independent assumption for the joint operators. This allows for 
terms that would in general result in ensemble averages to be given by identical single 
particle values. 

The final element of Chap. 4 is a series of numerical experiments testing the 
accuracy and performance of the projection filter against exact simulations for initial 
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pure spin coherent states. The rate k sets the measurement timescale and so all 
times in the simulations arc compared to this rate, effectively setting it to 1. Each 
simulation ran for a fixed time t = 0.2 k~^. As the quality of the projection filter 
should explicitly depend upon n, these simulations test the qubit numbers n — 
{1,25,50,75,100}. The average performance data included a sample oi u — 100 
isotropically sampled initial qubit states with a single noise realization for each initial 
state. 

In addition to testing the performance as a function of n, it also tests two different 
control functions /*(t). The first is for /*(t) = for all t and i. This corresponds to a 
QND measurement of the z projection of the total angular momentum formed by the 
qubit ensemble, J^, and is known to produce spin squeezing. We find that an initial 
state involving 50 qubits prepared in +Jx eigenstate produced ~ 10 dB of squeezing 
in one measurement duration. A squeezed state is inherently not a product state, 
and so serves as a worst case scenario for the projection filter and acts as a lower 
bound on its performance. 

The zero field measurement is compared to the case of a strong randomized 
control sequence. Chap. 5 uses the projection filter in an algorithm to reconstruct 
the initial condition of a SCS from a continuous measurement of J^, characterized 
by the rate k. In order to obtain information about observables other than Jz, an 
external control Hamiltonian must be applied. For reasons discussed in Sec. 5.3.1, 
this takes form of a sequence of global 7r/2 rotations, where each rotation is about 
an axis n independently sampled from a uniform distribution. Fully characterizing 
the control amplitude f{t) requires specifying the amplitude and duration of each 
pulse, as a larger Larmor frequency is needed to enact the same rotation in a shorter 
time. For simplicity, we will fix f{t) to have a constant magnitude and so for a pulse 
duration r the control field is then given by. 





m=l 
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where X[a,b) {t) is the indicator function for the interval [a,b) and {rim} are i.i.d. 
unit vectors drawn from a isotropic distribution. 

The accuracy of the projection filter was tested by comparing how well it is 
capable of reproducing the conditional expectation values of the collective angular 
momentum components Jj as well as the squared overlap between the exact state and 
the equivalent spin coherent state that is made from an ensemble of n identical pure 
qubits. The time-dependent results are presented in Figures 4.4 and 4.5. The RMS 
error in the conditional expectation values were generally independent of the number 
of qubits, likely due to the fact that for pure states the projection filter dynamics 
are essentially independent of n. With the randomized controls the RMS error was 
< 5% of the total spin length in all 3 expectation values. In the absence of a control 
field, there was a general linear increase in the Jx and Jy errors also reaching the 
5% level, while there was a noticeable increase in the error with a final value in 
the ~ 5 — 10% range. The poorer performance is attributable to the effect the spin 
squeezing has on the mean values. 

The squared overlap between the exact state and the equivalent spin coherent 
state exhibits a strong dependence upon both the number of qubits and the con- 
trol fields. In the uncontrolled case this metric monotonic decreased for all n > 1 
dropping to 0.75 for n = 25 and 0.48 for n = 100. This is in stark contrast to the 
simulations including the randomized controls. While the resulting average fidelity 
was noticeably poorer for large n. the minimum value was > 0.8 for all n. While the 
trend was to have poorer fidelities at longer times, the decrease was not monotonic 
implying that the control wave form could be optimized to maximize the average 
overlap with the spin coherent state and thereby minimizing the information lost by 
performing the the projection. 

We hypothesize that the state remains closer to a product state because the 
randomized controls tends to mix both the squeezed and anti-squeezed components 
leading to a near zero average. Not only does the mean spin rotate, but the orien- 
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tation of the squeezing ellipse also rotates. As the rotation axes are chosen from a 
uniform distribution, the squeezed component is just as likely as the anti-squeezed 
component to be oriented along the measurement axis. At any given time, the uncer- 
tainty in the component is equally likely to be above or below the uncertainty of an 
equivalent spin coherent state. Therefore it is difficult for any significant squeezing 
to develop, and thereby keep the exact state closer to the product state description. 

1.1.4 Qubit state reconstruction 

Chap. 5 describes how to use the quantum filtering formalism in order to construct 
a tomographic estimate for an unknown initial quantum state from an ensemble of 
identical copies experiencing a joint continuous measurement. We make a maximum 
likehhood estimate (MLE) of the initial state, based upon the statistics of a single 
continuous measurement realization. The purpose of this work is to extend previous 
results [11-13], which used a continuous measurement for quantum state tomogra- 
phy, into a regime where the quantum backaction significantly effect the measurement 
statistics. In an idealized numerical study, we find that such an estimate can nearly 
saturate an optimum reconstruction bound. Much is known about the fundamental 
quantum limits of reconstructing pure qubit states from a finite number of measure- 
ments. Massar and Popcscu showed that given n copies of a pure qubit state, it is 
possible to find a generalized measurement that returns the highest average fidelity 
between the estimated state and the correct initial state [39]. The average is made 
not just over measurement outcomes but also over an unbiased set of possible input 
states. The optimum average fidelity bound is simply given {T)opt — {n + l)/{n + 2). 

We consider here an idealized model of an ensemble of n qubits identically coupled 
to a single traveling wave quantum light field via a linearized Faraday interaction. 
Under certain approximations discussed in Sec. 2.7, a measurement of the orthogonal 
quadrature contains information about the collective angular momentum variable 
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Jz, with a coupling rate k. The ensemble is assumed to be prepared in a pure spin 
coherent state characterized by the unknown polar angles {6, 0). However the initial 
qubit state is not a QND variable, meaning that p{9, 0)®" does not commute with the 
fundamental interaction Hamiltonian. The implication of this is that it is impossible 
to find a consistent method for inverting the forward time dynamic to arrive at a 
conditional expression for the initial state. 

To circumvent this problem we instead map the quantum state estimation prob- 
lem to a parameter estimation problem to find a MLE of {9, (j)). A single continuous 
measurement realization of a noncommuting output quadrature results in a stochas- 
tic process {yt}t>o ^^^^ contains information about the atomic ensemble. Because 
of this information, its statistics parametrically depend upon the unknown angles. 
While a MLE based upon a single data point would perform quite poorly, we find a 
conditional estimate based upon the entire trajectory performs quite well, when the 
measurement is informationally complete. To ensure informational completeness, a 
known time-varying control Hamiltonian is applied to the system, thereby mixing 
all spin projections with the measurement axis. Riofrio et al. found that an efficient 
and unbiased control policy is to choose a set of operations capable of generating 
any single particle state and then randomly varying the magnitude of each control 
in a piecewise constant way [13]. Here we use a similar control policy by including 
in the modal a uniform magnetic field with a constant field strength that rotates the 
collective angular momentum vector by 7r/2 in a period r about randomly chosen 
rotation axes. In a fixed final time we find 40 rotations provides enough information 
to obtain high fidelity estimates. 

In the semiclassical probability space induced by a measurement realization, the 
appropriate probability measure P has a parametric dependence on the initial angles 
{0,(f)). Identifying this dependence is best seen by considering not the conditional 
statistics of {l/t}^>Q but instead the calculated innovation process {vt}^^^- For the 
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measurement model considered here this process is given by 

vt^yt-2y/^ [ dsTr{J,psie,(f))), (1.22) 
Jo 

where ps{0,(l)) is the system density operator calculated via the conditional master 
equation assuming that initial condition is given by the angles {0,(f)). The inno- 
vation process Vt is shown in Sec. 3.4.1 to have the statistics of a Wiener process 
if Tr{Jz Ps{0,(p)) corresponds to the exact quantum conditional expectation of the 
Heisenberg picture operator U^JzUf. If this correspondence cannot be made because 
Ps{d, 0) used an incorrect initial condition, then Vt will not have statistics of a Wiener 
process for every measurement realization. Here we use this fact to find the MLE for 
{9, 4>). We seek the initial condition that makes the innovation process most likely to 
be a Wiener process. As the conditional master equation is a nonlinear equation, we 
resort to a mixture of numerical and analytical methods for finding an approximation 
to the true likelihood function. 

The Wiener measure gives the probability for a Wiener process sampled at times 
{ti : i — 1, . . . ,n} will be within associated intervals /j = (oj, 6j) and is given by the 
integral 

pa.. . m ^ £■ M... f n (-^^) ) . (1.23) 

Because this is a Gaussian probability density, the MLE coincides with the least 
squares estimate. For an equally spaced mesh of finite time intervals both the nor- 
malization factor and the denominator of the exponent are irrelevant for the purposes 
of computing a MLE. Therefore maximizing the likelihood function is equivalent to 
minimizing the quadratic variation, 

n 

QV(vt) = J2 {^yi-2V^^tTr(J,ptU0, (1.24) 

i=l 

The minimization of this function with respect {9,(f)) is computed numerically as we 
are unable to find an analytic solution to the conditional master equation and the 
dependence upon the initial condition is nonlinear. 
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Every evaluation of this function requires a full numerical integration of the con- 
ditional master equation. Numerically integrating an exact conditional pure state is 
a computationally intensive task. Here we test spin ensemble involving 25 < n < 100 
qubits, which require order n complex number to fully describe the relevant condi- 
tional dynamics. While it is computationally feasible to integrate the exact equation 
to generate a simulated measurement record, for every measurement record we used 
a total of 500 evaluations of the quadratic variation cost function. We found it 
infeasible to use an exact expression for computing QV(t't) and instead sought a 
reasonably accurate approximation. The approximation we use is the projection fil- 
ter developed in Chap. 4. Under identical conditions to the dynamics used here, 
the projection filter is able to match the expectation value Tic{Jz pt) to within 95% 
accuracy of the exact value while only propagating 3 real numbers for any number 
of qubits. By minimizing with respect to the approximate filter, the limiting com- 
putational element became generating the simulated measurement record. While in 
a higher dimensional space one could approach the problem with a gradient descent 
algorithm, we find it more efficient to simply make a dense Monte Carlo sampling of 
the entire Bloch sphere^ and then select the most likely sample. 

In order to understand what role backaction plays in limiting the reconstruction 
fidelity, we compare the performance of the projection filter estimate to one that 
ignores completely the conditional dynamics. Instead of propagating a conditional 
state, this estimate only considers the Hamiltonian dynamics generated by the mag- 
netic field rotations. In other words by solving the Heisenberg equation of motion. 



for the controls p{t) given in Eq. (1.21), the expectation value ^/K, nTi{az{t)p{9 , (f))) 
reproduces the expected signal, ignoring the backaction. The purpose for computing 

^Due an issue involving numerical stability we start with a uniform sampling of mixed 
states and then make a subsequently smaller sample of pure states. See Sec. 5.4.2. 



dt 
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this in the Heisenberg picture is so that we are able to solve for the dynamical 
observables once, and then apply that solution for any initial condition. 

The results of the numerical experiments are given in Fig. 5.2. The estimate 
based upon the projection filter nearly achieves the optimum (n + l)/(n + 2) fidelity 
bound, averaged over u = 1000 trials for n = (25,40,55,70,85,100) qubits. The 
difference between the optimum bound and the numerical averages never exceeded 
0.21%, a deviation that is hkely statistically significant but not attributable to any 
fundamental Monte Carlo sampling errors. In comparison the backaction-free es- 
timator performed significantly poorer especially for higher qubit numbers. This 
suggest that including the conditional dynamics is indeed important in this idealized 
scenario. 

The cause of the discrepancy between the projection filter estimate and the 
backaction-free estimate is likely due to a bias that develops when all measurement 
effects arc ignored. This can be see in Figures 5.3 and 5.4. These figures plot the av- 
erage reconstruction fidelity as a function of the measurement duration. The filtering 
based estimate shows a monotonic rise in the average fidelity which then saturates at 
a level only slightly below the optimum bound. As n increases this saturation occurs 
at earlier times. In contrast, the estimate based only on Hamiltonian evolution does 
not have monotonic increase in reconstruction fidelity. For n = 55, 70, 85, 100 the 
average fidelity reaches a maximum and then has significant decrease as more data 
arc collected. For n = 25, 40 it is possible that a decrease might also have occurred 
if the simulation continued for longer times. 

When backaction is ignored, the assumption of pure unitary evolution implies that 
no coherence is lost during the course of the measurement. If at time t the random- 
ized controls managed to return to the original orientation then the backaction-free 
estimator would "weight" the data received at that time just as much as the data 
obtained at time t — 0. In comparison, the filtering based estimate knows that while 
the rotations may have canceled, the expected signal at time t is not what it was 
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at time t = 0, precisely because of the conditional effects. By not including this 
information the unitary estimate is biased away from the optimum estimate. 
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Chapter 2 

Quantum Optics and Quantum 
Stochastic Differential Equations 

The objective of this chapter is to identify how the formahsm of quantum stochas- 
tic differential equations is implemented in the context of quantum optics. This is 
done by first showing how the second quantization of classical quasi-monochromatic 
traveling wave packets gives the natural structure necessary for defining the quan- 
tum Ito integral. We then show under what conditions a wave packet operator can 
be treated as generating a localized field operator, which is necessary for a Markov 
approximation. Prom that localized structure, we then review how this defines a 
quantum Wiener process and relates to the quantum white noise formalism usually 
presented in quantum optics. With a wave packet description of quantum white 
noise, we then review how a system coupling to these operators generates a quantum 
stochastic differential equation for the propagator. Generating this equation is inti- 
mately related to the operator ordering of the field operators, which is also related 
to defining different kinds of stochastic equations. Here we review this fact and how 
the propagator is derived. Finally, we apply this result to the Faraday Hamiltonian 
and discuss the interaction in the limits of both strong and weak number coupling. 
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2.1 Quantum Stochastic Process in Optical Fields 

The classical stochastic process is most generally defined as a family of random vari- 
ables {xt}f^Q indexed by time t. A quantum stochastic process is then a family of 
operators {Xt}^^^ also indexed by time. This definition allows for a slightly more 
general structure than simply an operator is dependent upon time. A common ex- 
ample of a time-dependent quantum operator is a Heisenberg picture operator, X[t), 
acting on some Hilbert space H with its dynamics given by a unitary transformation. 
Conversely, a quantum stochastic process implies something more general, where the 
spectrum of Xt could be time-dependent and even the Hilbert space upon which it 
acts nontrivially could be continuously changing in time. 

A concrete and pertinent example is a continuous wave laser that is switched on 
at time to — and then switched off at some later time t > 0. Consider a stationary 
observer located a distance d = cr away from the laser who begins counting photons 
with a perfectly efficient detector at time to- By a time < s < r it is clear that 
this observer will have not observed any of the laser light and so, in absence of any 
corrupting background, the probability for observing anything must be zero. The 
point of this example is that in the time interval [0, s] any observations must be 
modeled as projectors acting on a part of Fock space that is independent of the part 
displaced by the laser. 

It is perfectly reasonable to use a Schrodinger picture description where the ob- 
server makes projective measurements on a volume of the electromagnetic field and 
the free-field Hamiltonian acts in such a way as to propagate the state of the field 
fixed from the laser position to the detector. In Sec. 3.2, a mapping between a set 
of commuting observables and a classical probability space is developed, where there 
is a one-to-one correspondence between classical random variables and commuting 
operators. To utilize these tools of classical probabihty theory, it is most natural to 
work in a Heisenberg picture where the states remain fixed and the unitary evolution 
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is applied to the observables of interest. To develop a Heisenberg picture formula- 
tion for a continuous measurement, we require a mathematical structure that can 
cope with the fact that as time progresses a stationary observer will measure an ever 
increasing set of operators, and these operators act upon different parts of the field's 
Hilbert space. This family of operators is the quintessential definition of a quantum 
stochastic process. 

In classical stochastic calculus, the Wiener process is the fundamental random 
process from which the Ito integral is constructed and from there other processes are 
defined. In the quantum setting, we require equally fundamental operator-valued 
noise processes from which we will construct other processes. But as we are seek- 
ing a description of a continuous optical measurement, those processes should arise 
from the quantized electromagnetic field. The next section reviews the canonical 
quantization of the free electromagnetic field to identify the operator nature of the 
quantized field. 

Throughout this chapter we will be discussing both classical and quantized ele- 
ments of the electromagnetic field. In order to make this distinction, the classical 
vector fields for the vector potential, electric field, etc. will be denoted as ^(x, t), 
5(x, t) and their quantized operator expressions as A(x, t), £^(x, t). We will quan- 
tize the free space electromagnetic field following the classic text by Cohen- Tannoudji 
et al., and use SI units [44]. For reference, the spatial Fourier transform of a function 
/(x, t) is defined as 




(2.1) 



and the inverse transform is 




(2.2) 
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2.1.1 Free space quantization 

When rigorously quantizing the free space electromagnetic field, one begins by defin- 
ing a scalar Lagrangian functional, with respect to variations in the vector potential, 
A.{x,t), whose minimization reproduces Maxwell's equations [44]. The field Hamil- 
tonian is 



Kf^ [ d'x ( ^ + leoc' |V X Af I (2.3) 
with the conjugate variable to the vector potential 11 being 

U^So^^A^-eo£. (2.4) 

(The final equality with the electric field is made by assuming that there are no free 
charge.) Prom this classical Hamiltonian, the connection with quantum mechanics 
is made by noting this is the Hamiltonian of a continuous set of harmonic oscillators 
with canonical variables {.4., 11}. Quantization then promotes these variables to 
canonically commuting operators. 

By choosing to work in the Coulomb gauge, it is easily shown that if v4(k, t) is 
the Fourier transform of A then 

k-A = 0. (2.5) 

This constraint results in two free polarization components (labeled by s e {1,2}) 
for each Fourier component, defining the vectors eg(k) which satisfy the properties 

k-eg(k) = 0, (2.6a) 
e;{k)-e,,{k)^Sg,,, and (2.6b) 

where i, j refer to the Cartesian components. In terms of the real space operators the 
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components of the quantized fields A(x) and -E(x) satisfy the commutation relations, 
[A(x),A,(x')] = [£;,(x),£;,(x')] = (2.7) 



[yl,(x), -SoEji^')] = th6ij{^ - x') (2.8) 
where Sfj^x. — x') is the transverse delta function, defined as 

4(x - / (gs e--'-"') (S, - J^) . (2.9) 

In reciprocal space, the vector potential is most suitably expressed in terms of the 
annihilation (^^(k)) and creation (a^(k)) operators associated with the polarization 
vectors eq(k). They obey the commutation relations, 

[a,(k), a,,(k')] = [at(k), a;,(k')] = and (2.10) 
[a,(k),a;,(k')] = V^(k-k'). (2.11) 

The Schrodinger pictures operators for the vector potential and electric field are then 
given by 



AW = i:/(575^1J^^""'«:(k)a,(k)+A.c., (2.12) 



Substituting these expressions into the Hamiltonian results in the simplified form 

Hf = ^hcY^ I d^k |k| (ag(k)aj(k) + a;(k)aq(k)) . (2.15) 
q 

We will follow the standard practice of discarding any vacuum energy contributions 
and write 

Hf^hcJ2 J |k| a5(k)ag(k). (2.16) 
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2.2 Wave Packets, Fock Space and Stochastic 
Processes 



For a single simple harmonic oscillator, the state space is spanned by a complete 
basis of states labeling the number of quanta in the oscillator. In the free-space EM 
field, we have instead a continuous distribution of oscillators, each representing a 
plane wave Fourier component with one of two orthogonal polarization states. This 
continuous nature means that the field operators are unbounded in two ways: for a 
given k we can have a countably infinite number of quanta and a single quantum 
can have an unbounded amount of energy if we allow pure plane wave states with 
arbitral large wave numbers |k|. The solution to these problems is to consider only 
states of light for which these operators return finite quantities. 

Notice that in Eqs. (2.12 - 2.16) the plane wave operators ag(k) and aj(k) act 
as operator-valued integral kernels where they are combined with various weighting 
functions to form the physically relevant operators. The fact that they have the 
singular commutation relation [oq(k), a^,(k')] = Sgg/ S(k — k') implies that they are 
only well defined in the context of an integral, where the Dirac delta function is well 
behaved. The point is that the domain of the operators E, B and Hf that return finite 
eigenvalues should not be considered as a set of distinct plane wave oscillators, but 
instead in terms of continuous functions defined over ranges of Fourier components. 
We will refer to these distributions as wave packets, in that by constructing a properly 
weighted distribution over plane waves one arrives with a localized pulse or packet 
of waves that propagates in some direction. Rather than initially discussing wave 
packets in terms of single quanta, it is easier to first define wave packet states in 
terms of semiclassical states of hght that generalize the coherent state of a single 
mode harmonic oscillator. 
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2.2.1 Wave packets 

A semiclassical wave packet identifies those states of liglit tliat reproduce coherent 
classical radiation when one takes expectation values of the quantized operators 
A(x), E(x), etc. This relationship has been identified by many authors, e.g. Deutsch 
[40], Garrison and Chiao [41], Smith and Raymer [42]. We review these results here, 
focusing on the physical interpretation for the wave packet distributions. 

The single-mode coherent state, = |q;), is characterized by the complex ampli- 
tude a, where the mean photon number is given by |q;| and is an eigenstate of the 
annihilation operator a jo;) = a\a). We have seen that in the canonical quantization 
of the free field, each plane wave and transverse polarization vector has its own anni- 
hilation operator aq(k) and so for a corresponding coherent state of light requires a 
complex vector valued function g(k). The coherent state '^[g] satisfies the equation 



By hypothesizing the existence of the states iplg] we would like to see how the 
coherent amplitude function g(k) relates to physical quantities in expectation. By 
taking the expectation value of the (vacuum energy removed) Hamiltonian Eq. (2.16) 
we can easily see that 




(2.17) 




(2.18) 



An equally trivial calculation results in 




(2.19) 



and 




(2.20) 
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It is possible to invert these two equations and so express g(k) in terms of the spatial 
Fourier transform of a classical vector potential, ^(k, t). Performing this inversion 
we find that, 



g(k) = X I (c |k| A{k, 0) + i ^Mk, t) 



2;ic|k| V dt 



(2.21) 



t=o> 

When quantizing the field, Eq. (2.21) and its adjoint are nothing more than the 
"normal variables" that are in classical correspondence to aq{k) and a^g{k) [44]. 

It is common in optics to relate the physical classical fields A., S, and B to a 
unitless mode function. In terms of the vector potential this results in the ansatz, 

A{x, t)^Ao{ ti(+) (x, t) + u^-^ (x, t) ) (2.22) 

where Aq is a real constant and u^~^\'x,t) is a complex unit-less mode function. The 
fact that the vector potential is required to be real, we have the relation that 

u(-)(x,t) =u(+)*(x,t). (2.23) 

As u^+)(x, t) is unitless, its integral 

V ^ j £x \u^+\x,t)\^ (2.24) 

has units of volume and is referred to as the mode volume of the field. By taking 
the spatial Fourier transform of Eq. (2.22) we have 

A{k, t) = Ao (k, t) + (k, t)) (2.25) 

The purpose of separating between u*^"'")(x, t) and u(~)(x, t) is to allow for the sepa- 
ration between positive and negative frequency components, respectively. For a free 
field then, 

u^+\k,t) = u(+)(k,0) e-'^l^l*. (2.26) 
The Fourier space version of Eq. 2.1.1 is 

u(-)(k,t) = u(+)*(-k,t). (2.27) 
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To simplify the expression for g(k) as given in Eq. (2.21), we need to compute the 
time derivative of A{k,t). Substituting Eq. (2.26) into Eq. (2.25) and computing 
the derivative we have 

§^A{Kt) = -tc\k\Ao (S(+Hk,t) - G^+^*(-k,t)) . (2.28) 

Substituting this expression into Eq. (2.21) leads to 



g(k,t)=Ay^^5W(k,t). (2.29) 

Rather than including the vector potential constant which usually contains in- 
formation about the overall intensity of the field, it is useful to relate it back to the 
magnitude of g. We first define the characteristic wave number ki as 

/■ , , , |G(+)(k,0)|' , , 

h= d^k |k| J \ ■ (2-30) 

By considering v~^ |u^+^(k, 0)|^ to be a normahzed distribution in reciprocal space, 
then ki is the average magnitude. With this definition 

ur=Ai^-^. (2.31) 

Inverting this relationship results in 

g(k,t) = ||g|i#^^e-*'. (2.32) 

V ki y/V 



It is worth noting that the units of Eq. (2.32) is of root volume and that ||g|| 
now acts as a unitless scaling factor. This final formula shows the fundamental 
relationship between a distribution over coherent state amplitudes g(k) and the 
positive frequency Fourier components of the mode function u^+)(k, 0). While in one 
sense this has simply been an algebraic exercise (expressing one distribution over 
spatial frequencies in terms of another) the real utility of this expression is that the 
mode function u^'^\x,t) has practical imphcations as it describes the spatial and 
temporal properties of a propagating laser beam. 
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Finally, we express the expected energy in a wave packet state in terms of the 
envelope function. Simply substituting Eq. (2.32) into Eq. (2.18) results in, 

Similarly to defining the mean wave vector ki we can define a two-norm wave vector 

, r , , ,2 |G(+)(k,o)|'V , , 

k2= { (i^k |k|' J (2.34) 



so that 

= M\' (2.35) 

If, however, |u*^+^(k, 0)|^ is a sharply peaked function centered at some large vector 
ko, then we have that |ko| ~ /ci ~ A;2. In this case the average energy is then 

where ujq = c |ko|. 



2.2.2 Weyl operators 

Assuming the existence of the semiclassical states is only a first step, but real utihty 
comes from finding the family of operator that generate these states. In the context 
of the simple harmonic oscillators, the coherent state with amplitude a is generated 
by the unitary displacement operator 

Dsho{<^) = exp (aa^ — a* a) with \a) = Dsho{ci) |0). (2.37) 

Writing (2.37) in terms of its generator T{a) 

Aho(«) = exp(-^T(a)) (2.38) 
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we find that 

T(a)^i{aa^ -a*d) . (2.39) 

Note that as g'g(k) is a distribution of coherent amplitudes over all plane wave modes, 
we make the correspondence 

a*a ^ g;{k)ag{k). (2.40) 

But as this is a pointwise weighting over each plane wave, we define the total field 

operators a[g] and a^[g] to be 

d[g] = J2 I c?'M:(k)a,(k) (2.41) 

and 

«^[g]^E / d'kg,{k)dl{k). (2.42) 

This are sometimes called smeared creation and annihilation operators as they have 
been spread over a range of k values. By applying the commutation relations (2.10), 
it is easy to see that 

[d[i],d%]\ = J d'kr{k)-g{k). (2.43) 

An important property that we will use is that by the linearity of the integral over 
d^k we have that, for complex coefficients ci and C2 

ci d^[f] + 02 d^[g] = d^[cj + C2g] (2.44) 

and 

Cia[f] + C2a[g] =a[ctf + c;g]. (2.45) 

In other words a^[-] is linear in its argument and d[-] is anti-linear. The continuous 
analog of the displacement operator, called a Weyl operator, is 

W[g]=exp(at[g]-a[g]) (2.46) 
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and the coherent state ip[g] is given by 



^[g]=W[g]|0). 



(2.47) 



Applying the Zassenhaus formula to the Weyl operator shows that 



W[g] = exp(aT[g]) exp(a[g]) exp 



( 




(2.48) 



Note that because a[g] |0) = for any g, this implies that 



ijj[g] = exp 




(2.49) 



When proving limits involving sequences of coherent states, it is often more conve- 
nient to work with unnormalized state vectors. Therefore, it is common to define an 
exponential vector 



One particularly useful relationship that we will end up applying repeatedly is that 



i.e., ip[g] is an eigenstate of any smeared annihilation operator a[f], regardless of the 
smearing function f . If, however, the functions f and g are orthogonal, then that 
eigenvalue could very well be zero. 

2.2.3 Fock space 

A number of useful relations can be derived involving the Weyl displacement opera- 
tors and the exponential vectors. Before doing so it is necessary to introduce some 
of the formal and algebraic properties of second quantization and Fock spaces. Note 
that if we take f and g to be any square integrable complex functions, then the right 



e[g] = exp(aT[g]) |0). 



(2.50) 




(2.51) 
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hand side of (2.43) forms an inner product on a Hilbert space of wave packets. We 
will denote the inner product as, 

(g, f) = J] / d'k g;{k)f,{k) (2.52) 

and the Hilbert space of wave packets as 

M{g(k) : (g, g)<oo}. (2.53) 



A Fock space ^ is a total Hilbert space describing a unknown and possibly 
unbounded number of particles that are each represented by states in a single particle 
Hilbert space 1^. If we are given a single particle from f), then the full Hilbert space 
of two such particles is the tensor product of two such Hilbert spaces. Likewise for 
three particles, there would be three fold product. We will notate the joint space of 
n particle as P)®" = 1) §>> 1) ® ■ ■ ■ ® I) where there are n such products. In terms of a 
total space with an indeterminant number of particles, each subspace that contains 
n particle will be mutually orthogonal. Thus the total space is the direct sum over 
each subspace. If we take the space for zero particles to be the complex numbers, 
— C), then the full Fock space is given by 

oo 

=^fuii(f)) = (2.54) 

n=0 

The reason for the notation =^fuii(f)) is that if f) is a Hilbert space of bosonic particles 
than only states that are symmetric under particle exchange will apply. We denote 
the symmetric subspace of f)®" to be f)®''". So the symmetric Fock space is given by 

oo 

^sym(f)) = f)^^". (2.55) 
n=0 

Wc arc strictly interested in bosonic particles, so throughout this document we when 
refer to =^(-) we are referring to the symmetric Fock space. 

For a single simple harmonic oscillator, the coherent state \a) is expressed in 
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terms of the number states \n) as 




(2.56) 



The equivalent expression for the wave packet state ■^[f] is 



oo 




^[f] = e-ll^ll^/^0 



(2.57) 



n=0 



From the relation that (f®", g®"^ = ^f^ g^"^ -y^^e have 



(^[f]|^[g]) 



exp(-i(||f|r + ||g|r) + (f, g)) 



(2.58) 



or equivalently 



(e[f]|e[g]) 



e{f,g>. 



(2.59) 



A number of useful properties involving the Weyl displacement operators and the 
exponential vectors are the following: 

• The Weyl operators obey the composition law 



• The linear span of all the exponential vectors (and equivalently the coherent 
states) is dense in the symmetric Fock space ^(f)), meaning that any state 
in t^([)) can be represented by a limiting sequence of a linear combination of 
exponential vectors [45]. 

• Written in terms of the single particle inner product, the exponential vector 
e[g] is an eigenvector of the annihilation operator a[f] with, 



W[g] W[f] = exp (-i((g, f) - (f, g)) W[g + f]. 



(2.60) 



The action of the Weyl operator on an exponential vector is 



W[g]c[f] = e-<S'f>-llsll'/'e[f + g]. 



(2.61) 



«[f]e[g] = (f, g) e[g]. 



(2.62) 
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2.2.4 A basis independent expression for the wave packet 
inner product 

An alternative to expressing ||g(k, in terms of the characteristic parameters, 
V, ki, etc. is to relate it to a basis independent expression involving the physical 
(classical) fields £ and A.. Prom Eq. (2.29) we can see that 



g(k,t)= ^?^^W(k,t) (2.63) 
and that 

l|g(k,t)ir = | 



- J rf^^A; c|k| ^W*(k,t) ■^W(k,t). (2.64) 



The presence of the factor c |k| makes this expression inherently tied to the k basis 
and not immediately expressible in terms real space quantities. However by recog- 
nizing that in the Coulomb gauge £ = —■§^A. and ^'^"'"^(k, t) = A'^~^\'k,0)e~^'^^^^^ we 
have the equality 

c\k\A'^+\k,t) = -iS^+\k,t). (2.65) 

Substituting this relation into Eq. (2.64), 

||g(k,t)||2 = '^Jd'k £W*(k,t) •^W(k,t). (2.66) 

This expression is basis independent, in the sense that we can take the inverse trans- 
forms to arrive at 



|2 



^2^0 

h 



J d^x £(+)*(x,t) •^W(x,t). (2.67) 



In [42] , Smith and Raymer derive a Dirac quantization scheme for a photon wave 
function, equivalent to the more standard expressions reviewed in Sec. 2.1.1. In that 
work they assume that for each polarization vector q there exists a countable set 
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of complete scalar orthonormal wave packets {^'^^(k)}, which therefore satisfy the 
properties 



/ 



5;]^,,(k)*^,,(k') = (5(k-k') and 

j 



(2.68) 



They then observe that the classical electric fields |£^j^''(x, t)| in correspondence to 
these wave packets, via Eq. (2.20), are no-longcr orthogonal in a real space overlap 
integral precisely because of the weighting factor of \/|k|, 



J d'x £j+^*(x,t) 4+;(x,t) ^ 5,,. V- (2-69) 

They also observe that if instead one considers the overlap with the vector potential 
then the orthogonality is preserved, due to the cancelation of the factors of \/|k|. 
This is precisely the statement that if (gj(k, i), gj/(k, i)) = Sjj', then 

(g,(k,t), g,,(k,t)) = jd'x Sf^*{^,t) ■ Af\^,t) 

(2.70) 

= ^ / * (x, t ) • £ (x, t) = S,,, . 

In quantizing a photon's wave function Smith and Raymer consider £^'^\'x.,t) to be 
the fundamental single particle wave functions. Secondly they observe that in order 

to preserve orthogonality in the real space inner product then the dual vectors are 
not £^~^^*{'x,t) but are instead proportional to A.^~^^*{'x,t). 

In this work we will continue to view the single particle vectors to be the wave 
packet functions g and not the associated classical electric field. This is for two 
reasons. Firstly, it is mathematically convenient that the vector dual to the wave 
packet g is simply its complex conjugate. The second reason is that we continue 
to treat g as an analogy with the simple harmonic oscillator's coherent state and 
the vector potential A and the electric field £ are in correspondence with X and P 
quadratures. 
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2.2.5 Fock space and stochastic srocesses 

By structuring the Hilbert space of the free EM field as a Fock space defined over 
a single particle Hilbert space [), we can now define a quantum stochastic processes 
and a quantum stochastic calculus. Consider again the example of a coherent laser 
pulse propagating towards a photon counter. A quantum description of a traveling 
wave laser pulse is a coherent wave packet state ■0[g] where g(k, t) is related to the 
classical field by Eq.(2.29). Imagine a perfect space fixed detector that is capable 
of returning a voltage directly proportional to the total energy in a given classical 
wave packet g(k, t). Furthermore, imagine that this detector is activated between 
the times [io,^i], and after this interval the voltage is read. If the "entirety" of an 
incident wave packet g, could be absorbed in that time, then the detector should be 
modeled as making a projective measurement on the part of Fock space containing 
il^[g]. Depending upon the details of the detector, it could likely have recorded 
pulses that were similar enough to g, either in magnitude, temporal profile or carrier 
frequency. For instance, a 100% efficient detector with a linear response should be 
able to measure pulses with 2 /iW of average laser power just as well as a pulse with 
200 mW of power. By modeling a physical measurement as a Hermitian operator 
acting upon some Fock space, we need to define the set of possible wave packets 
the detector could have completely measured. Fig. 2.1 shows a schematic where a 
paraxial laser pulse is focused upon a gated photo-detector. 

In the second quantization formalism we can give a mathematical chain from a 
classical wave packets to field operators. In the time interval [to,ti] a fixed detector 
could projectively measure some set {g} of incident wave packets and the linear 
span of these wave packets forms a subspace i)ito,ti] C f). In turn i)[to,ti] defines 
a subspace '^{i)[to,ti]) C ^{i))- Furthermore there exist operators O[to,ti] that act 
nontrivially on coherent states ■0[g] £ =^(^)[to,ti]) but as the identity for any ■0[g"'"] for 
g"*" ^ f)[to,t]- An operator ^[to,ti] identified by this procedure then defines a quantum 
stochastic process, by considering the family of operators {-'^[to,*] : to '^t < oo}. The 
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Paraxial Measurement Model 




Figure 2.1: A Model of a Paraxial Measurement. A photo-detector is positioned 
relative to an optical system which defines a paraxial beam with a characteristic 
wavelength and mode profile. Here the optical system is defined simply by a 
focusing lens and color filter with the beam schematically indicated by lines 
of constant intensity. The detector is activated between times to and ti which 
corresponds, at time t = to a pulse localized in space in the region Az = c{ti — 
to). For a perfect detector with a linear response, the integrated output current 
will be proportional to the total pulse energy and can be modeled as making a 
projective measurement on the sub-Fock space of pulses localized between these 
times. 
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requirement that X[tQ,t] acts trivially on coherent states defines a process that is 

time- adapted, in direct analogy with a time-adapted classical stochastic process (see 
Sec. 3.1.1 for the definition of a time-adapted classical stochastic process). Turning 
this qualitative procedure into a mathematically sound object requires explicitly 
constructing {)[to,ti], which means defining what it means to measure the entirety 
of a wave packet in a finite time interval. While naively this may seem trivial, in 
practice it intersects with the problem of defining a localizable photon in quantum 
field theory. We will illustrate why this is an issue next. 

2.2.6 Localized wave packets and stochastic processes 

While the canonical quantization of the free field is most easily performed in the 
Fourier domain, the mathematical structure of the second quantized Fock space 
^{i)) is generally basis independent. The operators a[g] and a^[f] can be related to 
the coherent states ip[h] without any reference to the fact that the wave packets are 
originally defined with respect to k. Any unitary transformation of g is an equally 
valid expression of the wave packet state in that the Hilbert space of wave packets 
i) — {g '■ (g, g) < oo} is basis independent. The only element that depends upon 
g being defined in the Fourier domain is its relationship to the spatial profile of the 
mode function u^~^\x,t). But as we have defined the wave packets in the Fourier 

1 1 2 

domain, it is not immediately apparent what effect the constraint ||g|| < oo has on 
w*^+)(x, t). One drastic result of this constraint is that it prohibits one from defining 
fields that are strictly locahzed in space [41, 46]. 

To see why this is true, consider a one-dimensional case where we wish to define 
square wave pulse of duration L, with a carrier frequency ujq — cko. The mode 
function for such a pulse is 



"loc (^'^) =^lo,L] - ct) exp{+iko{z - at)). 



(2.71) 
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Taking the spatial Fourier transform shows that 

u[^J{k,t) = sine {lL{k - ko)) exp ( - - ko) L - ickt) (2.72) 
V 27r 

and from Eq. (2.29), we then have gioc{k,t) oc Vku[^^ {k,t). The Vk factor makes 
all the difference, as if we try and calculate the norm we find 

/oo 
dk \k\ sinc2(|L(A; - ko)) = oo. (2.73) 
-oo 

The failure of this calculation stems from the fact that the indicator function X[a,6] is 
a discontinuous function and this discontinuity presents itself in the Fourier domain 
by this divergence. This example implies that there is a nonlocalizable property of 
photon wave packets. This nonlocal property of a photon wave function has been 
studied by many authors, with various definitions for a photon's wave function, see 
[42] for a pertinent discussion. While this example does not show that any locahzed 
wave packet suffers from this or a similar problem, this is indeed the case. In [46], 
Bialynicki-Birula proves that the energy density of a photon can be localized no 
better than an exponential function, exp(— /(r)), where /(r) grows slightly slower 
than a linear function in r. 

At this point, we must make some approximation thereby admitting localized 
states of light. Ultimately this means that we will relax the measurement window 
to include functions localized with in exponentially damped tails. However, we gain 
physical insight by considering a temporal rescaling so that on a timescale that is 
long compared to an optical period, a smoothly varying function can appear to be 
a localized discontinuous function. We will also show that though this rescaling, 
one obtains all the familiar approximations in quantum optics, namely the Markov, 
quasi-monochromatic, and rotating wave approximations. It also provides a gateway 
for defining quantum white noise and equivalently the necessary conditions for ap- 
plying a quantum Wong-Zakai theorem to arrive at a physical realization of quantum 
stochastic calculus. 
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2.3 Paraxial Envelopes and Measurable Pulses 

Before constructing temporally localized wave packet we first must address the spa- 
tial/temporal decomposition indicated in Fig. 2.1. An excellent mathematical de- 
scription for the focused collection of light by a series of thin lenses is to model 
the system paraxially where a coherent plane wave is propagating along the optical 
axis but is spatially and temporally modulated by a slowly varying envelope func- 
tion. This envelope function describes how the plane wave is localized to the optical 
axis as well as how the phase fronts are distorted by the optical system [47, 48]. 
Appendix A reviews the derivation of the paraxial wave equation, which describes 
the propagation of a slowly- vary ing-envelope, as well as computes its spatial Fourier 
transform. 

A paraxial and quasi-monochromatic wave is characterized by the complex func- 
tion 

= /(^.)4^^(xT,-^)e-^'^°*^ (2.74) 

Here is the axis of propagation, x^- is the remaining transverse coordinates and 
tr — t — z/cis the retarded time. The paraxial mode function u^^(xr, z) describes 
how the carrier wave (with angular frequency uq, and wave number ko) is modulated 
as it propagates along the optical axis. Note that this is a time independent quantity. 
Any nontrivial time dependence is given by the temporal envelope function /(tr), 
which decouples from the paraxial mode The only requirement is that f{tr) 

be slowly varying, |^/| <^ coq |/|, ensuring the full solution is quasi-monochromatic. 
The problem of finding the space of measurable wave packets now translates into 
finding the temporal envelopes {/(tr)} that "fit" in the measurement window [to, ti]- 

Identifying the appropriate spatial mode function uip\xT, z) for a given optical 
system, is simply a problem of classical optics and is well modeled by a Hermite- 
Gaussian mode function [48]. Here we are only concerned with the fact that such 
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a function exists and is well defined and has a given "transverse area". In free 
space conservation of energy requires that the total power passing though a plane 
transverse to the optical axis be conserved, which manifests thought the property 
that for any paraxial mode we have 

2 



d xt 



4+)(kT,;2) =aT (2.75) 



and that (Tt is independent of z for any finite z. While the distribution of energy 
in the transverse plane can vary due to diffraction, the total power passing though 
an infinite transverse plane will be conserved. This transverse area can be combined 
with the square integrated temporal duration 

j dt \f{t)\\ (2.76) 

to construct the total mode volume 

V = CT (Tt- (2.77) 

We have already shown how a wave packet g is related to the spatial Fourier 
transform of a classical vector potential, ^^+)(k, t), and how that can be expressed 
in terms of a unitless mode function u(+)(x, t). The spatial Fourier transform of the 
paraxial mode function is given by 

WW(k,i) = c/(a;(k) -a;o) 4+'(kT, 0) e'^'^^'^)*. (2.78) 

where / is the temporal Fourier transform of the pulse envelope, u^''(kT,0) is the 
spatial transform of the mode function with respect to the transverse coordinates xt 
(evaluated at 2; = 0) and C(;(k) is the approximate frequency 

|2 

2ko 



a;(k) = c |k| fti c I + kA . (2.79) 



Eq. (2.29) relates g(k,t) to ^(+)(k,t) and so 

g(k, t) = A y^^^c/(a;(k) - u,) aW(k^) e--W*. (2.80) 
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In Sec. 2.2.1, we eliminated the constant Ao in favor of an expression in terms 
of characteristic parameters, namely the mode volume v and the characteristic wave 
number ki. Here we abandon ki in favor of a frequency oui — cki, which is equal to 

2 



d^k u{k) of {u{k) - Uo) u^t\^t) 



(2.81) 



Calculating this integral is easiest with the change of variables 
{kr, kz} — >■ {kr, i^(k)}, we find that 



UJi 



d^kj 



T 



4^^(kT) 

2 



(2.82) 



^ as 



Implicit in the paraxial approximation, is the requirement that 
jkrl^ — >■ oo. This fall off imphes that we can treat the factor c Ik^l^/ 2A;o in ^^^(k) 
as a finite and independent offset to the du^k) integral and not consider how a;(k) 
converges as A;^ — )■ — oo with Ik^l ^ oo. We can then make another change of 
variables a;(k) ^ + o'o so that 



^kj 



4^HkT) 



T 



when /(t) is real-valued, it is simple to show that j{y) 
therefore mean zero. In that case we have 



(2.83) 



is an even function and 



(2.84) 



will 



(2.85) 



as one would intuitively expect. In the case of a complex valued f{t), f{i') 
not in general be mean zero. In this more general case. 

For the one-dimensional localized pulse. Sec. 2.2.6 demonstrated that the second 
integral is infinite. By integral expressions for the Fourier transforms and then inte- 
grating by parts, we can show that 



/dv V 



I 



dv 



V 



dt 



1 

cjo \dt 



nt) fit). 



(2.86) 



Chapter 2. Quantum Optics and Quantum Stochastic Differential Equations 52 



By the slowly varying envelope approximation, we require that |;f /(^)| -C cuq \f{t)\, 
and in order for the quasi-monochromatic regime to hold this integral must be a 
small correction factor. Therefore, we find that 




g(k, t) ^ ||g|| J / (a;(k) - coo) uS^^(kT, 0) e-^C^)*. (2.87) 



2.3.1 Paraxial wave packets in the time domain 

Even in a quasi-monochromatic regime, the wave packet g is still tied the Fourier 



basis due to the factor of ^yu(k). We just showed that when f{t) is real- valued or 

1 1 2 

very slowly varying then this factor plays no role in calculating ||g|| . Here we would 
like express the inner product between wave packets that share the same spatial mode 
in terms of real space coordinates in order observe what effect this "nonlocal" factor 
has on their temporal distinguishably. If we are able to make the approximation 

that w(k) ^ uq for a family of wave packets then there will be a simple unitary 
relationship between wave packets the real and Fourier domains. The goal of this 
section is to identify this family. 

Consider the two wave packets gi(k, t) and g2(k, t) that share the same paraxial 
mode function uip \'Kj', z) and carrier frequencies, but have differing temporal profiles 
fi{tr) and f2{tr)- For simplicity we will assume that the corrective factor of Eq. (2.86) 
is small and so Eq. (2.87) is valid for each wave packet. If we then calculate the 
unequal time inner product, we have that 

(gl(k,t0,g2(k,t2))= I'^^I'I'^^II 



[ d'k 4+)(kT, 0) ' Jl (a;(k) - a;o) h H^) - ^o) e--^'^)^*-*^). 

(2.88) 



By again making the change of variables {k^, k^} {k^, u} with i/ — a;(k) — ujq we 
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are able to integrate out the transverse degrees of freedom to arrive at 

{g,{Kt,),g,{Kh))=^^^jM fdu^^^ (2.89) 

In analogy with the convolution theorem, it is easy to show that 

J dvJl{v)h{y)e-'''' = J dsms)Ms + t) = f^^h{t) (2.90) 

where /i*/2 (t) is the cross-correlation function between /i and /2 evaluated at time 
t. Furthermore by repeating the integration by parts transformation from Eq. (2.86) 
we have that 

J du uJl{i,)Uv)e-^^' = -^^^/2 it). (2.91) 
Combining these two facts, 

(gi(k,tO, g2(k,t2)) = MMe— o(*.-*i) 



/i^/2 ih-t,)-t-^^h (t2-ti)) (2.92) 
ojq dt J 



While previously we were able to show that for zero delay and a real-valued envelope 
the second term would be identically zero, this is clearly not the case for different 
wave packets. However, due to the slowly varying envelope approximation we know 
that this must be a small correction. Ignoring this correction results in 

(gi(k,ii), g2(k,i2)) ~ MlMg-..o(*.-*i) f,^f,{t2-t,). (2.93) 

\/Tir2 

Eq. (2.93) shows that the overlap between the two wave packets is proportional to the 
cross correlation function of the temporal envelopes. Physically this is a extremely 
satisfying result, as if we have the two (paraxial) field operators a[gi(k, fi)] and 
a1'[g2(k,t2)] then 

[a[gi(k,ti)],at[g2(k,t2)]] oc/i^/2(t2-ti) (2.94) 
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meaning that field operators for uncorrelated temporal envelopes commute! Fur- 
thermore if wc can construct a wave packet ip(k,t) whose temporal envelope (p{tr) 
is (approximately) delta correlated in time then, 

[dMk,t)], d^[^ik,t')]] ^6it'-t). (2.95) 

This is significant because this is the defining feature of quantum white noise, which 
is discussed in Sec. 2.6. Before doing so, we will apply the results of this section the 
defining i)[t,,t2]- 



2.3.2 The measurable subspace 

In the ideal situation, the measurable wave packet are the wave packets defined on 
the paraxial mode w^''(xx, z) with a envelope functions f{t) such that f{s) = for 
all s ^ [io)^i]- Unfortunately because of the problem of localization no such physical 
wave packets exist. If we allow for discontinuous functions, then for any function 
9(t), 

9[toM]it)=^[toM]it)9{t) (2.96) 

is clearly zero for any t ^ [toj^i] and therefore would be an element of f)[to,ti]. In 
order to define approximately localized temporal envelopes we need an approximate 
form of the indicator function X[to,ti] (i), i.e. a smooth cut off function. A common 
choice for such a function is to convolve X[to,ti] if) with a smooth positive normalized 
distribution function (/^('^^(t), where a represents the degree of localization. For a 
concrete example, if ^p'^'^^it) is a mean-zero normalized Gaussian with variance 
then. 



V27r(T2 



7iip^^P(-^) (5) (2.97) 



i(erf(^)-erf(^: 
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Note that because hm^j^Q (p^'^\t) = S{t), we also have hmo._>o '^[toti] (^) ~^lUi,ti] (^)- 
Again, to maintain a quasi-monochromatic field f(t) must be slowly varying. This 
statement is quantified by the relation ^ |^| -C |/|, and in terms of a this means 

— < 1. (2.98) 
aujo 

In quantum optical systems a carrier frequency of uq = 271 x 370 THz is not 
uncommon, and has a corresponding wavelength of Aq = 810 nm. A common use for a 
laser at this wavelength is to generate optical pules as short as 5 fs in duration [49] . If 
we take this to be a minimum but physically realizable timescale than we would have 
{a uJo)~^ ~ 0.08. While a wave packet of this duration is still relatively slowly- varying, 
its likely that the correction to ui in Eq. (2.86) could be a nonneghgible contribution, 
as well as other higher order effects. A convenient limit would be to set a such that 
(cTCUo)"^ ~ 10~^, meaning that for the near infrared wavelengths a ~ 0.1 ps. While 
this sets a physically realistic smoothing variance, it does not say when Xj^'^''^^] {t) is 
a good approximation to an actual indicator function, as this requires a comparison 
between the smoothing and its overall duration. Fig. 2.2 illustrates this distinction by 
plotting XjQ^^j (t) with a — 0.1 ps and a series of durations, 0.5 ps < r < 100 ps. For 
visual comparison each indicator is plotted in scaled units of r. Simple inspection 
shows that for intervals on the scale of r > 10 ps a smoothing variance of 0.1 ps 
makes an excellent approximation to the truly discontinuous function. Note that as 
t extends beyond the interval [0, r], Xjq^^] {t) decays like an error function scaled by 
(7 and this extent is independent of r for r ^ a. So that for r = 10 ps and r = 0.1 

With this function in hand we can now identify a space of wave packets that are 
able to be projectively measured approximately in the time window of [to,^i]- For 
any valid envelope function f{t), we can define a localized version 

ftUt)^4Ut)m- (2-99) 

Clearly these functions are good temporal envelopes and approximately fit in f)[tQ,ti]. 
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Figure 2.2: Approximations for a localized pulse. Shown here are a series of 
slowly varying temporal envelopes. Each envelope is a unit pulse centered at zero, 
with a variable duration r, convolved with a Gaussian smoother with a = 0.1 
ps. The pulse duration r ranges between 0.5 ps to 100 ps. Each pulse is plotted 
verses t in units of r. 



Note that we can actually increase the space of valid wave packets by observing that 
with an appropriate a, ^[^1,^^] (t) is itself a valid temporal envelope. Therefore for any 
function f{t) < oo whose support is contained in the interval [to^^i], we can define a 
smooth version via convolution 



(2.100) 
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And so any wave packet whose temporal envelope is defined in this way is approx- 
imately mcasm^able in the time interval [to^^i]- This can be formalized by defining 
the set of functions, 

^[2^, = |(^('^) : J dtfit) < oo, supp(/) C [to,ti]| (2.101) 

and the set of wave packets 



',\:\ , = span ^ g = n^i^o) 4"Hk^,0) ^ ^ 



^0 ^y\tl-to\ 

(2.102) 



We then have the limit 



hoMl - (2.103) 



2.4 The one-dimensional limit 



In the previous sections we showed that if there are two quasi-monochromatic wave 
packets gi and g2 defined on the same paraxial mode but with differing temporal 
envelops /i and /2, then the commutator between a[gi] and a^[g2] is proportional to 
the cross-correlation function /i*/2- Furthermore the proportionality is independent 
of the details of the paraxial mode. This suggests that moving to a simplified, 
one-dimensional model is both appropriate and fruitful. This section makes this 
connection and relates it to the standard representations of quantum white noise. 

The end of Sec. 2.3.1 suggested defining a wave packet (^(k, t) whose temporal 
envelope, (f{tr) is delta correlated in time. In defining the smoothed set of functions 
"^[to^ti]' 2-3.2 took any integrable function defined on the interval [to, ti], /[to,ti](^), 
and convolved it with a Gaussian distribution cp^'^^ (t) to obtain an envelope consistent 
with the quasi-monochromatic approximation. To move to a one-dimensional model. 
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we will factor out f[to,ti]{t) from field operator and define an operator- valued density 
a[ip^'^\t)]. We will shortly show that this is approximately delta commuting in time. 

Deriving this factorization is not difficult, and begins by first noting that as the 
Fourier transform of a convolution is proportional to the product of the Fourier 
transforms, we have that 

f^^\u) = f{u) ^^"^u) = (^J dsf{s)e+"''^ ^^"^H- (2.104) 

Substituting this expression into the definition of g*^'^-*(k, t), as written in Eq. (2.87), 
we have. 



(k,t) = M y f{s)e-'^°' ¥'(")(k,t - s) (2.105) 



where 



ip^''\k,t) = J^^^^W {uj{k)-uJo) HkT, 0) e-'^C')*. (2.106) 

By the anti-linear nature of the creation operator a[g] we are able to bring the integral 
over s out of the operator to write, 

a[g^''\k,t)] jds r{s)e+^°^a[<p^^\t-s)]. (2.107) 

Note that a[g('^)(k, i)] and a[(p'^'^\t — s)] have different units, as the former is unit- 
less while the latter has units of l/Vtime, ultimately arising from the fact that 
(p'-'^\t) is a density over time and therefore has units. Eq. (2.106) has the following 
physical implications. First is that the integral is the point-wise weighting of an 
annihilation operator by a complex amplitude, completely akin to original analogy 
of one-dimensional simple harmonic oscillator a* a -H- f*{t) d{t). The second impli- 
cation is that the complex weighting function in general matches the phase of the 
carrier wave, resulting the explicit appearance of the e+"^°* factor. The third impli- 
cation is that both the function /(s)/^/r and the operator dlcp^'^^t — s)] have the 
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same units, which are the same as white noise. The final imphcation comes from the 
fact that because we have the commutator 

[d[cp^''\t)], a^[ip^''\t')]] = e-''^°(*'-*) ip^''^ ip^''\t' -t) + {{auo)-^) (2.108) 

and the hmit hm^_,o <P^''^ * V^^Kt' -t)^ S{t' - t), then 

'd[g^^\Kt)], d%^\Kt)]\ = MM f ds f*{s) Ms). (2.109) 



hm 

aJoT— >-oo 



This imphes that if we have two square integrable functions, hi{t) and h2{t) then 
these functions can count as members of a single particle Hilbert space, f)' = >C^(R). 
Furthermore we can define a Fock space ^{i)') and ultimately the field operators 
d[hi\ and a[/i2]- If 



hit) - ^/i(t), (2.110) 



then we can draw the formal equivalence 



d[hi] ^ lim a[g^^\ (2.111) 

<T—^0 



While discussing the statistical aspects of quantum light can be interesting in its 
own right, the real fun is when that light is coupled to another quantum system. Sec. 
2.6 shows how when a system couples though an interaction Hamiltonian to operators 
similar to a[c^*^'^)(t)] and d'^[ip^"'\t)], the limiting object can be written in terms of a 
quantum stochastic process on the joint Hilbert space Tisys ® ^(f)')- Furthermore it 
discusses how this relates to the standard expressions in quantum optics involving 
simpler models of quantum white noise. Before including the system however, we 
will show what is gained by taking this discontinuous limit and how it is useful for 
defining quantum stochastic processes. 
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2.5 Quantum Wiener processes and the 
continuous-time decomposition 

Quantum stochastic integrals were first defined mathematically by Hudson and Par- 
thasarathy in 1984. There they formulated a quantum version of a Ito-type stochas- 
tic integral where the fundamental differentials, in correspondence to the classical 
Wiener process and other jump processes, are operators acting on a bosonic Fock 
space [23]. Independently Gardiner and Collett formulated a physical description of 
quantum white noise operators where creation and annihilation operators are asso- 
ciated with excitations in a bosonic heat bath, which are then used as driving noise 
sources in a quantum Langevin equation [50]. This second formulation is the most 
well known in the quantum optics community (see, e.g., the well written reference 
[51]) but is less amenable for directly applying the filtering techniques of classical 
probability theory. The picture of a heat bath does not immediately induce a pic- 
ture of a traveling flow of information from a probe system to a detector. Rather it 
instills a picture of a system immersed in stationary and chaotic environment and it 
is unclear what it means quantum mechanically to "measure the bath" . While one 
certainly could, and often does, construct a large scale flow in the bath running from 
the system to an independent observer such a construction ultimately resembles a 
wave packet description. 

If one instead explicitly includes time into a description of the environment, as 
Hudson and Parthasarathy do, then statistical properties necessary for deflning a 
quantum Wiener process and a quantum Ito integral, namely the ability to construct 
time-adapted processes, is a direct consequence. We will shortly review how this is 
done, but flrst note that in contrast to relying on the system to dictate how the 
bath is modeled, this represents a more axiomatic approach in that the statistical 
properties of the bath are postulated independently from the system. Now clearly the 
physics of the entire system-probe-measurement combination will dictate whether or 
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not this is an appropriate model. The purpose of this section is to show what is 
gained in this formulation. 

Like the Gardiner and CoUett formulation, this formulation begins by assuming 
a bosonic Fock space, however here it assumes that it is a second quantization of a 
single particle Hilbert space, 

f) = £'(M+)®f)' (2.112) 

where >C^(R+) represents the Hilbert space of square integrable functions defined on 
the positive real line (representing time) and f)' is an auxiliary Hilbert space. Almost 
all formulations immediately assume that f)' is a finite d-dimensional system and so 
every g(t) is effectively a complex- vector- valued function, i.e. g : M"*" C^. In this 
case the inner product between two single particle vectors is 

/■oo 

(f, g)= / dtrit)-g{t)<oo. (2.113) 

Prom this single particle Hilbert space the symmetric Fock space =^({)), exponential 
vectors e[g], and Weyl displacement operators W[g] are identical to their definitions 
in Sec. 2.2.3. And most importantly, we can define the annihilation and creation 
operators 

= a[X[o,t] e,] (2.114) 

and 

Al^ = at[X[o,t] e^] (2.115) 
that have the commutation relation 

r 1 

Ai, Ai^ = ds' Xio,s] {s') X[o,t] {s') e, • e,- = Sij min(s, t). (2.116) 

L Jo 

These processes serve as two of the building blocks of quantum stochastic calculus 
and are in analogy with an n-dimensional Wiener process. To show this last analogy, 
we must first discuss how we specifically include time in f). 
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2.5.1 The continuous- time tensor decomposition 

In basis quantum mechanics there is an intimate connection between statistical in- 
dependence and a tensor product structure. When a complete system is described 
by the tensor product of two Hilbert spaces and the total state is a product state 
then both systems can be considered statistically independent. Specifically in this 
case two operators from the individual Hilbert spaces Xi and X2 are statistically 
independent in the sense that 

{X^^X2),^{X^),,{X,)^. (2.117) 

In Sec. 2.2.5 we introduced the notion of a time-adapted quantum stochastic process, 
where a quantum operator 0[o,t] was time-adapted if it acted as the identity on any 
coherent ip[g'^] where is excluded from the time interval [0,t]; g"'"(s) = for 
< s <t. We defined two mutually orthogonal spaces of wave packets [)[o,t] and 
which in turn have their associated Fock spaces =^(i^[o,t]]) and ^{i)-^^^). The classical 
definition of a time-adapted stochastic process is that the process is statistically 
independent of all events in the future. In light of the connection between statistical 
independence on the one hand and a tensor product structure on the other, it seems 
reasonable to have 

=^(f))-=^([)[o,])®^([)fo,]) (2.118) 

where = indicates a unitary equivalence. But if f)[o,t] represents all wave packets 
locahzed to [0,t] then it also seems reasonable to conclude that = f)(t,oo)- 

In this section we will show that this tensor product decomposition is not only 
possible for any single time t but it is also possible for any sequence of n ordered 
times {tn '■ < ti < ■ ■ ■ < tn < 00}. This is called the continuous-time tensor de- 
composition and is the relation that 

^{i)) ^ ^(f)[0,i,)) ® ^(f)[*l,*.)) ® • • • ® nhn,00)). (2.119) 
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The proof of this statement is outhned in lemma 2.1 (which is essentially proposition 
19.6 of [52]). 

Now if Eq. (2.119) is true, then for any partitioning of time, no matter how small, 
this Fock space decomposes into a tensor product between the various partitions. 
Furthermore if we have operators {0[ti,ti+i)} which are each adapted to the interval 
then we have that the expectation values factorize. 



In other words, if both the operators and the state respects the continuous- time 
decomposition then those operators will be statistically independent for independent 
times. 

Why is this important? Well, Sec. 3.1.4 reviews the basic properties of a Wiener 
process and shows how its defining feature is that its restrictions to independent 
time increments are statistically independent and that each are mean zero Gaussian 
random variables of variance ij+i — ti. Therefore, any quantum analog of a Wiener 
process must also respect the continuous-time decomposition. The fact that the 
classical Wiener process satisfies the Markov and martingale properties is a direct 
consequence of this independence [53]. Sec. 3.1.1 reviews the definition of these two 
properties and how they relate to taking conditional expectation values. Additionally, 
the Ito definition of a stochastic integral, (see Appendix B ) is defined in such a way 
so that the integral J Xt dwt is also a martingale and that if Xt is Markovian than so is 
the integral. For a quantum stochastic integral to also have these desirable properties, 
a necessary criteria is that a process {Xf}^^^ must be statistically independent of all 
future events. In the next section we return to the operators Al and A^^ and show 
how they can be used to construct a quantum Wiener process, but first we include 
a proof of the continuous-time decomposition. 

Lemma 2.1. Given the single particle Hilbert space f) = jC'^(R+)®C'^ and an ordered 
sequence of times = {t^ e R"*" : < ti < ■ ■ • < tn < oo}, the symmetric Fock 




(2.120) 
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space satisfies unitarily equivalence ^(f)) = ^([)[o,ti]) ® '^(f)[ti,t2)) ® ■ ■ ■ ® ^([)[t„,oo)) 
where i)lt,,u^,]=C\[ti,ti+,])(^C''. 

Sketch of Proof. For any vector g e f) we can define the projection of g onto a time 
interval via 

moM]{t) =X[toM]it) g{t) (2.121) 
and tliat for all n times we have 

S{t) = g[o,t,){t) + g[t„i,)(t) + ■ ■ ■ + g[t„,oo)(t)- (2.122) 
As this is true for any element f), we have the natural decomposition, 

i) = n[0M) ® n[t,,t,) ® • • • ® ^[t„,oo)- (2.123) 

where l-L[ti,ti+i) is the space of square integrable vector valued functions of dimension 
d defined on the interval [ti, ti+i). Because of this decomposition, proving that ^{i)) 
satisfies the tensor decomposition now means proving unitary equivalence 

n n 

(2-124) 

i=l 1=1 

This is easily shown by first noting that 

(g[ti,ti+i)' ffe.t^+i)) ^ ^^^3 (g[*i,ti+i)' ■ (2.125) 

This however implies that for the exponential vectors e[g] and e[f] 

n 

(e[g]|e[f]) ^ exp((g, f)) = JJexp «g[t„t,+0' huM^.))) ■ (2-126) 

i=i 

If we define the transformation V : ^{i)) — >■ ^([)[o,ti]) (8) • • • (8) ^([)[t„,oo)) such 
that 

^e[g] = e[g[o,i,)] 8) • • • ® e[g[i„,oo)], (2.127) 
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The inner product between two transformed vectors is then 

n 

{V e[g]\V e[i]) = J]^ e^^[*-*i+i)'''[*-*i+i)) = e^^'''^ (2.128) 

This shows that y is a unitary transformation between the exponential vectors. 
However because the exponential vectors are dense in the symmetric Fock space, V 
linearly extends to any vector in □ 



2.5.2 The quantum Wiener process 

In [25], Bouten et al. given an elegant derivation of how the quadratures 

Ql = Al + Al^ and = i (^A^^ - Al) , (2.129) 

have the statistics of a Wiener processes, when the field is in the vacuum state. For 
the sake of completeness we reproduce this derivation here. 

In Sec. 2.2.2, we introduced a[g] and a^[g] though the generators of the coher- 
ent state iplg] through the relation, tp[g] = W[g]|0) = exp(a"''[g] — a[g]) |0). The 
argument of this displacement operator defines a Hermitian generator 

T[g]=^(at[g]-a[g]) (2.130) 

so that exp(a"''[g] — a[g]) = exp(— iT[g]). 
For a classical random variable x then 

) (2.131) 

is the characteristic function for that random variable and therefore characterizes 
its statistics. The Weyl operator exp(— iT[g]) is nearly equivalent to the char- 
acteristic function, up to the constant k and a minus sign. However though the 
anti-linear property of a[g] we have a[Ag] = A*a[g], but if A = —k, (real k) then 
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exp(— iT[— Kg]) = exp(+ifi;T[g]). Converting this operator into a true characteristic 
function simply means taking an expectation value with respect to the field state. If 
the state is in a coherent state, then 

Vr[s]{i^) = (exp(+mT[g]))^[f] , (2.132) 

which characterizes the statistics of the operator T[g]. In terms of the Weyl dis- 
placement operators this means that 

V^T[g](«:) = (^[f]| W[-«:g] |^[f]) = e-llf|l' (e[f] | W[-«:g] |e[f]) . (2.133) 

Eq. (2.61) relates the action of the Weyl operator to the exponential vector showing 
that this simplifies to 

ipriM = exp(- ||f f + ^ (g, f) - ||gf /2) (e[f]|e[-«:g + f]) 

= exp(- ||f 11^ + K (g, f ) - \\g\f /2-K (f , g) + ||f II') (2.134) 
= exp(i K2 Im (g, f) — \\g\f /2). 
The final line is recognizable as the characteristic function of a Gaussian random 
variable of mean 2 Im (g, f) and variance ||g||'. Note that when g =X[o,t)(t) e^, 

T[X[o,)(i)e,]=i(Ajt-4)=p; (2.135) 

and 

T[-i X[o,tM e.] = + 4 = Qi- (2-136) 

1 1 2 

In either case, ||g|| = t and so that both operators have variance t, regardless of the 
coherent amplitude f of the underlying state. 

When the state of the field is in vacuum (f = 0) then both quadratures are 
mean zero, Gaussian random variables whose variance is given by t. Lemma 2.1 also 
shows that the operator T[X[s,t)(t) ej] = — P* is a generator of displacements in 
the Fock space =^(f)[s,t)) and therefore commutes with any generator for states in 
=^(^)[o,s))- Glearly the vacuum respects the continuous tensor product decomposition 
and therefore the quantum stochastic processes {Ql}f^Q and {-P/ }j>o have, in vacuum 
expectation, the statistics of Wiener processes. 
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2.5.3 The units of quantum noise 

An extremely observant reader might be concerned about the quadrature definitions 
of Qt and Pt given in Eq. (2.129). The issue hes in the units and physical interpre- 
tation of the single particle wave functions. In the strictest sense of second quan- 
tization, the normalized vector f e >C^(R+) (8) C'' are single particle wave functions 
whose square represents the probability density for observing the particle at some 
point in its domain. In order for f to be a square normalized density, / (it |f |^ = 1, 
means that f must have units of ^^r^^ - Consequently the operators a[f] and a"''[f] 
must be unitless as their commutator is subsequently unitless. However as written, 
the quadratures Ql and P/ have the commutation relation 

[QlP^]=2it, (2.137) 

which clearly has units of time. The solution to this distinction is realize that when 

defining At we should really be considering the field operators relative to some char- 
acteristic rate 7. Through the linearity of a[f], we clearly have 

«[\/7^m] = 77^*- (2.138) 

The whole point of defining At in this ways is that regardless to the magnitude of 
7 1, the scaled quadrature ^7 Qt will still have the statistics of a Brownian motion, 
simply with the diffusion rate 7. 

The objective of Sec. 2.3 was to identify on what scales we could treat a quasi- 
monochromatic field to be statistically independent for independent increments of 
time. Fig. 2.2 showed the scaling of a smoothed characteristic function XjQ^^j(t) for a 
fixed smoothing variance and a variable duration r. The act of smoothing limited the 
derivative to be at most on the order of l/cr and for r ~ 10^ cr this had little effect on 
the visual appearance of the smoothed function. Note that the actual correction term 
to inner product in Eq. (2.92) compared the rate of change of the temporal envelope 
to the carrier frequency cjq and the introduction of the smoothing distribution is to 
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simply limit this derivative. Assuming that 7 represents the rate of diffusion then 
for times r ~ I/7 any pulse should appear to have a discontinuous derivative on 
this scale. In other words, in order to treat an optical field as generating a quantum 
Wiener process we must have 

7 < (J~^ < coo- (2.139) 

In any realistic application, the physics of the system typically sets values for 7 and 
ujq. In an atomic physics context the carrier frequency is usually a dipole allowed 
optical transition, leading to a;o ~ 27r x 100 THz. For such a transition the measure- 
ment timescale is on the order of the lifetime of the excited state, tdecay ~ 10 ns, 
meaning that typically, 7 ~ 27r x 10 MHz. This leaves 7 orders of magnitude be- 
tween these two scales. In many atomic systems this is actually an upper bound 
on the measurement rate. Typically one will consider off resonant light leading to 
a significantly slower diffusion rate. In term of this dissertation. Chap. 5 applies 
a quantum stochastic treatment to an idealized model of the Faraday interaction, 
where 7 is reduced by a factor of one over this frequency difference. This will be 
discussed in detail in Sec. 2.7. Before considering this specific model we will review 
how to transition from a smooth deterministic Schrodinger equation, to one involving 
quantum stochastic integral with respect to and AI. 



2.6 Systems Interacting with Quantum Noise 

Much of this chapter has alluded to coupling a quantum system of interest to a 
traveling wave field and describing the resulting evolution in terms of a quantum 
stochastic process. In the quantum optics literature a system coupled to delta com- 
muting field operators have been discussed since the work of Gardiner and CoUett, if 
not before [50] . The field operators are typically defined as the Fourier transform of 
one-dimensional operators quantizing a continuous spectrum of harmonic oscillators. 
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That is, from the operators a{uj) and a^{uj), [a{(jj), a'*' (a;')] = S{u! — cu'), 

1 r°° 

a{t) = ^ dw e+'^'^aiuj). (2.140) 
v27r Joo 

It then follows that [a{t),a^{t')] — 6{t' — t). Prom these operators one typically 
formulates the noncommuting quadratures, q(t) = d^(t) + a(t), and p(t) = ia^(t) — 
id{t). If the state of the field is specified to be in vacuum, then {q{t))^ = and 
{q{t)q(t')) ^ = 6{t — t'). In other words, in vacuum expectation q{t) has the statistics 
of white noise and the same is true for p{t) and any rotated combination of the two. 
This is why the field operator d{t) is typically given the designation as quantum 
white noise. 

A system is introduced to the problem typically through a linear interaction 
Hamiltonian where after making a couple of approximations vary much in line with 
our assumed separation of timescale, the interaction Hamiltonian reads [51] 

HintiX, t) = ihy/^ {a}{X, t) c - a(A, t) c^) (2.141) 

where c is a generic system operator and d{X,t) is an operator that limits to d{t) 
as A — > ^. To arriving at Hint{X,t), a transformation to an interaction picture 
was made and all time dependence was associated with either d{X,t) or its adjoint. 
In this interaction picture, the joint state of the system and field evolves under a 
unitary propagator U{X,t), which satisfies the equation 

fuiX,t) = -iHint{X,t)UiX,t). (2.142) 

It is well known that for a general time-dependent Hamiltonian the resulting propa- 
gator is given by a time ordered exponential 

C/(A,t) = fexp (^-^^ dsHi„t{X,s)^ . (2.143) 

^Actually Gardiner and Collctt consider the limit in the frequency domain and so they 
consider a bandwidth 9 that approaches infinity. For our purposes it is more convenient to 
consider A — > 0, but the spirit is the same. 
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The ultimate goal is to interpret the operator Mmx^o U{\,t) = Ut as a solution 
to an equivalent quantum stochastic differential equation. Appendix C reviews the 
mathematical background needed to fully discuss these kinds of equations in the 
language of quantum stochastic processes acting in terms of a second quantized Fock 
space. 

In the textbook formulation of quantum noise, Gardiner and ZoUer define a form 
of quantum Ito calculus that is explicitly tide to the statistical properties of the state 
of the field and considers only a small family of Gaussian states [51]. Conversely, the 
quantum Ito calculus defined by Hudson and Parthasarathy has an Itb rule that is 
independent of the field state and the proof of convergence holds over a large domain 
of possibly correlated system field states [23] . The chief distinction between the two 
formulations is that the quantum optics derivation is based in a specific model, one 
that does not initially assume the structure necessary for the more abstract version. 

Prom the point of view of a physicist trying to model a quantum system starting 
from an interaction Hamiltonian and the Schrodinger equation, it is not at all clear 
how and when that fits into the abstract quantum Ito calculus. While Hudson 
and Parthasarathy gave criteria for when a quantum stochastic differential equation 
describes a unitary process, they did not specify how one should arrive at such a 
process from a Schrodinger equation and an approximating principle. This is exactly 
what the quantum optics derivation provides, however without making the final 
connection to the state independent Ito calculus. In 1990, Accardi et al. proved this 
connection where they showed how a unitary propagator generated from a linear 
Hamiltonian similar to Eq. (2.141) converged to an Ito integral as specified by 
Hudson and Parthasarathy. Additionally they showed that the proof holds in "the 
weak sense of matrix elements." What this means is the following. 

Suppose we are given two coherent states with smoothed wave packets ■0[gi°^^] ^iid 
^^[§2'^''] ^^'i t^^^ when (T — >■ we have the equivalent discontinuous coherent states 
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iplg'i] and '?/^[g2]- Furthermore suppose we are given a stochastic processes < X^'^^ \ 
that contains both system and field operators. Then X^'^^ Xt in the weak sense 
of matrix elements if, for arbitrary system state vectors 0i and 02, 



hm (^i^V^fe^] Xt^ 02 0V'[gr]> = (0i'»V'[g'i]|^t|02 0V'[g2])- (2.144) 



It's worth noting Accardi et al. shows that the limit holds for the bear propa- 
gator U{X,t) as well as for the Heisenberg evolution of a system operator, i.e. 
W{X,t)XU(X,t). 

We mention this here because as long as the total system-field state ptot can be 
represented in terms of the matrix elements 

(^)i 



lini (01 (g) '] ptot 02 i>[g,2 

(7— ^-U 



aLOQ—^oo 

then the quantum stochastic representation is appropriate. Clearly these states are 
much more complex than simply mean-zero Gaussian states as they can represent 
entangled states between the system and the quasi-monochromatic field as well as 
nonclassical field states such as superpositions between modes and even single photon 
states. However not every field state is included, as we must allow for discontinu- 
ous yet quasi- monochromatic matrix elements^. In other words, the total state ptot 
must be compatible with the approximations that make the stochastic representation 
possible to begin with. 

Here we are also interested in moving beyond an interaction Hamiltonian that is 
linear in a^(A,t) and a{X,t). This is because the Faraday interaction is fundamen- 
tally quadratic in the field operators as it describes the scattering of light in one 
polarization state to another, see Sec. 2.7. Fortunately, in 2006 Gough extended the 
results of Accardi et al. to include scattering or conservation interactions. In classical 
stochastic calculus the conversion between a smooth ordinary differential equation 



^For defining a reasonable calculus it is also required that the wave packet amplitudes 
take on a large but finite maximum value. 
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and a stochastic differential equation is tlie Wong-Zakai tlieorem. Not surprisingly, 
the conversion between a smooth Schrodinger equation and quantum stochastic dif- 
ferential equation for the propagator is called the quantum Wong-Zakai theorem in 
[43]. 

The specific Hamiltonian that the quantum Wong-Zakai theorem considers and 
the one that we will use here is the interaction Hamiltonian, (sum over repeated 
indices with i, j — 1 . . . d) 

Hi^t{X, t) = h {Eij d\{\ t) a,(A, t) + a\{\ t) + Eq, a,(A, t) + Eqo) (2.145) 

where {E^p a, ^ = 0, . . . , d} are bounded operators acting on a system Hilbert 
space l-Lsys- Each term in Hint{\t) physically represent the following: 

• Eqq is an operator acting solely on system degrees of freedom, independent of 
the bosonic modes, with units of frequency, e.g. it could be what remains of 
the free system Hamiltonian after transforming to an interaction picture. 

• EiQ is a system operator that accompanies the creation of an excitation in the 
i*'* bosonic mode centered at time t. A canonical example would be an operator 
proportional to an atomic lowering operator, with units of 1/ \/time. 

• Eoj is a complementary process where, at time t, an excitation in the j*'* mode 
is removed. 

• Eij is a unitless system operator weighting an instantaneous scattering of 
quanta from the j*'* mode to the i*^. When i — j this can be interpreted 
as a system coupling to the number of quanta in that mode at time t. 

Note that as this Hamiltonian is required to be self-adjoint, the system operators 
must satisfy the constraint Eai3 = E^j^^. 

Adding the quadratic term has an interesting and slightly unexpected effect on 
the physics. Again the term E^j a|(A, t) dj{\, t) represents the instantaneous transfer 
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of a photon from mode j to mode i. However for A > 0, d\{X,t) has temporal 
extent, meaning that its possible for the system to interact again with the scattered 
quanta. If the magnitude of Ey is relatively small then the possibility of re-interaction 
maybe relatively small, but we have yet to impose any such constraint. Fortunately 
the whole problem of converting from the equation U{X,t) — —^Hint{\t)U{\,t) 
to a quantum stochastic differential equation, including the possibility of multiple 
scattering events was solved by Gough. Some of the details of this derivation is 
reviewed in Appendix D. 

Intimately related to this conversion is how the formal quantum Ito integral is 
related to the operator ordering of the constituent field operators. There exists a 
fundamental connection between the rules of quantum Ito calculus (see Appendix 
C) and whether an iterated integral containing a sequence of field operators are 
either time or normally ordered. This connection was also formalized by Gough in 
[43], which Appendix D reviews. The bottom line is that in order for the limit of 
the time ordered exponential in Eq. (2.143) to be interpreted as a solution to an 
equivalent quantum Ito stochastic integral it must be put into normal order with 
all of the annihilation operators to the right of the creation operators. The effects 
of the multiple scattering events become mathematically apparent when converting 
from the time ordered solution to a normally ordered form. 

Before we arc able to fully write down the resulting propagator we must address 
two important issues. The first is to concretely link the operator aj(A, t) to the wave 
packet theory introduced in this chapter as well consider a kind of field operator 
wholly different from what we have considered up to this point. 

2.6.1 Quantum white noise in paraxial wave packets 

There are several different derivations that lead to bosonic operators a{\t) and 
a^(A,t) that result in calling the object ]imx_^Qa{\,t) quantum white noise. For 
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instance Gardiner and ZoUer use a wide-bandwidth limit where they assume that the 
interaction Hamiltonian Hint is initially specified in the frequency domain and that 
the system couples preferentially frequencies centered at a large transition frequency 
ujq. They then assume that the coupling between the system and the operators a{uj) 
is nearly flat in a frequency band centered at ujq. When this flat coupling band 
is sufficiently wide, the effect is for the system to be interacting with an operator 
representing a white spectrum and is therefore delta correlated [51]. Accardi et al. 
take a different approach by considering a weak-coupling/long-time limit. The weak 
coupling implies that on short "optical" timescales the system field interaction can be 
considered perturbatively but that on longer "mesoscopic" times the aggregate effect 
is nontrivial and the field fiuctuations develop a diffusive characteristic. Though a 
subtle re-scaling of time, a field operator a(A, t) emerges and is delta commuting 
as A — >■ [55]. Rather than fixing ourselves to a specific system-field interaction, 
this work has focused instead on integrating the language of second quantization 
and stochastic processes with a realistic description of classical optics meaning that 
neither model fully fits our needs. Instead we hope to find a description that is not 
tide to a specific model but capture the spirit of each. 

Consider the time ordered exponential 



\ Jo J 
where Hinti^,t), is given in Eq. (2.145). Expanding the exponential to just two 
terms shows us that 



Now consider the creation term, £'joa}(A, s). Physically, this means that locahzed 
in an time interval near time s, create an excitation in the i^^ field mode and while 




(2.146) 




(2.147) 
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you're at it, apply the system operator Eiq. Suppose the joint system was in a pure 
product state \ipi)\0) and that {ipi) happens to be an eigcnstate of EiQ with eigenvalue 
hi- Then the time integral over this term acting on this state gives 

-iEio [ ds dl{X,s)\iJi)\0) = -ihi [ ds dl{X,s)\iJi)\0). (2.148) 
Jo Jo 

This integral looks almost like a smoothed creation operator for a wave-packet with 
temporal envelope function hi{t) — —ihi X[o,t]{s) acting on vacuum. Specifically, Eq. 
(2.107) gives the expression for the one-dimensional smooth wave packet a[g('')(k,i)]. 
Taking the adjoint of that equation results in 

at[g(^)(k,i)] = M r ds f{s)e-'''°'d^[ip^''\t-s)]. (2.149) 
Jo 

The major distinction between this operator and the integral —ihi j^ds d\{\s) is 
the existence of the carrier phase e"^°^. This means that in order to interpret the 
integral —iEiq Jq ds a|(A,s) as creating a system dependent extended single photon 
state, the interaction must be inherently phase modulated at the carrier frequency 
loq. But this is exactly the same statement as Gardiner and Collctt when they assume 
that the system interacts with the bath at a large characteristic frequency. Therefore 
without specifying a detailed interaction model we can say that 

dl{X,s)^e-'-'^^d^[^t\-s)] (2.150) 

and 

a,(A,.)-e+-°^a[<^f)(-^)] (2-151) 

for some system characteristic frequency ooq and smoothing wave packets (f'f^ (k, s) 
and (pj^\k, s). The presence of the time reversal might be a little puzzling at first, 
but this is simply due to the fact that this wave packet was defined with respect to 
a convolution, which always time reverses one of the two functions. One possible 
mapping to include a set of d distinct modes is to assume the model considers a 
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set of d paraxial spatial mode functions ui (xy, z), which satisfy the orthogonality 
relation, J (fxTU*{'XT,z) ■ Uj{'XT,z) = Sij ax- 
To complete this discussion we should identify what the parameter A means in the 
wave packet context. We are able to explicitly compute the unequal time commutator 
from Eq. (2.92) and remembering that for the smoothing kernel ||g|| /^/T = 1. This 
results in 

'a,{X,t), a]{X,t')] = e+-°(*-*') {<p\''\-t), ^f\-t')) 
A is simply a parameter representing the formal limit that as A — )■ 0, cr — > and 



2.6.2 The scattering process 

Up until now, the only kind of field operator we have considered is a creation operator 
associated with a given single particle state g. While these operators are vitally 
important, it does leave out the possibility of a whole other class of field operators. 
Eq. (2.147) expanded the time order exponential for Ut to first order, which included 
the integral J^ds dl{X, s) dj{X, s). We will now show that this operator is quite 
different from the product of two smeared wave packet operators, particularly in the 
limit A ^ 0. 

Consider the exponential vectors e[f (A)] and e[h(A)], f , h e £^(]R+) ^C*^, defined 

as 

|e[f (A)]) = exp (^^ /,(t)al(A,t)^ |0). (2.153) 

If Jq ds aJ(A, s) dj{X, s) where some how equivalent to the product a'''[g]a[g] as A ^ 0, 
for some wave packet g we would have the eigenvalue relationship 

Ihn (e[f(A)]| at[g(A)]a[g(A)] |e[h(A)]> = (f, g) (g, h) (e[f]|e[h]) . (2.154) 
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We can explicitly show that this is not the case. To simplify the notation, we'll define 
the function 



Cij{X,t-s) = [ai{X,t), a]{X,s)], 



which has the property that 



lira Cij{X, t — s) — 5ij 5{t — s). 



Explicit calculation then shows that 



(2.155) 



(2.156) 



e[f(A)] 



ds a|(A, s) aj{X, s) 



e[h(A)]^ = 

Pt POO POO 

ds dsif^{si)c*^{X,t - s) / ds2Cjk{X,s - S2)hk{s2) (e[f (A)]|e[h(A)]) . 
Jo Jo Jo 

(2.157) 

Then in the discontinuous limit, 



lim ( e[f (A)] 



/ ds dl{X, s) dj{X, s) 
Jo 



e[h(A)]^ 

t 



dsf:{s)h,{s) (e[f]|e[h]). (2.158) 







In terms of an inner product on single particle wave vectors this is actually equal to. 



lim ( e[f (A)] 



ds a|(A, s) dj{X, s) 



ih{X)] ) 



= (f , X[o,*] e,e,- • h> (e[f]|e[h]). (2.159) 



This is clearly not an eigenvalue relationship involving a product of wave packets. 
In actuality it is a second quantization of an operator acting on the wave packets 
themselves [45, 52]. The specific operator here is multiplication by the indicator 
function X[o,t] (s) and the dot product into the dyad of basis vectors 6^6^. In the 
language of the Hudson and Parthasarathy formulation of QSDEs, this operator is a 
scattering or conservation process, and is notated as A]^ . While these processes can 
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be derived without reference to a limiting integral, our purposes we simply take this 
to be a definition 

a;-'' = lim /" ds dl{X,s)aj{X,s). (2.160) 
^-^^ Jo 

Appendix C reviews the notation and manipulation of QSDEs in terms of the quan- 
tum Ito differentials dAl, dAp and dA]-' . 



2.6.3 The limiting stochastic propagator 

With a firm connection between the interaction Hamiltonian with scattering terms 
and the wave packet theory, we are now able to express the limiting Hamiltonian in 
terms of an Ito form quantum stochastic differential equation. Appendix C shows 
that the most general quantum stochastic integral usually considered defines a pro- 
cess, 

Ut^Uo+ f dK'iF'J^ [ dA'jFi''+ [ dA{F^J + [ dsFf. (2.161) 
Jo Jo Jo Jo 

In Sec. C.l it also shows what constraint must be placed on the operators F^^ in 
order for Ut to be a unitary process. As any unitary can be written as exp(— iAj) for 
some generator At, instead of working with F^l^ it is more convenient to define the 
operators G"^ so that 

Ff = GfUs. (2.162) 

The unitary constraints written in terms of G^^ is given in Eq. (C.19). 

The bottom line result of the Quantum Wong-Zakai theorem is that the operators 
Gg^ are expressible in terms of the system operators Eai3 defining Hint{X,t) in Eq. 
(2.145) a matrix of constants Kij. As Ea/3 are assumed to be time independent this 
results in G^^^ — Gq^ and so we will omit the time index and demote the superscripts 
to subscripts. While the constants K^j are in general complex the simplest of all cases 
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is when Kij = ^Sij. Not only is this the simplest of cases it is also well motivated for 
our problem and so we will use it here, see Appendix D.2.2. With these simplifica- 
tions, the propagator Uf is expressible as a quantum stochastic Ito integral, which 
solves the recursive QSDE, 

dUt = Gij Ut dh^i + Gio Ut dAi^ + Cqj Ut dAl + Goo Ut dt. (2.163) 

The hmiting coefficients Gap are 

Gap = -lEa/S - \Eai i^^j^^l^ ^30^ (2.164) 

where i and j start from 1 and we defined IE as the matrix of operators E^j. The 
appearance of this matrix in the denominator is precisely due to the possibility of 
having multiple scattering events. A Neumann series is the operator- valued gener- 
ahzation of a geometric series, so that for an operator A, X^^o^" — (1 ~ ^)~^) 
which is well defined whenever 1 — A is invertible. The equivalent Ito coefficients, 
generates a Neumann series of operators where in this case A = -ilM and A" rep- 
resents a quantum scattering between the modes n times. The limiting coefficient 
then involves the i,j component of the operator/matrix inverse {1 + i^JEi)~^. For 
an intuitive physical picture, the coefficients Gap can be interpreted in the following 
way. 

Each coefficient Gap can be roughly thought of a right-to-left acting transforma- 
tion occurring on the system, dependent upon on how it couples though the field. 
The original, direct couplings Eap are still present as shown in the first term in Eq. 
(2.164). In addition to the direct coupling, there are the effects of coupling thought 
the various modes. As an example the second part of the Gjo coefficient shows a 
photon can be created in the i'^^ mode not just by just direct excitation, represented 
by the —iEio term, but also by first exciting j*'* mode, and then scattering any num- 
ber of times and then finally being emitted into the i*'*. The same goes for Goo and 
Gij except these either leave the field unchanged or transfer a quantum from mode 
j to mode i. 
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2.6.4 A simple ID example 

Nearly the simplest of all nontrivial examples of the quantum Wong-Zakai theorem 
is when d = 1, and 

En = 0, E,o = tV7D, Eoi = -t^^D\ and Eoo = Hsys (2.165) 

where D and Hgys are system operators. In other words, these are the coefficients 
for a total Hamiltonian 

Hi^t{X,t) = n {t^ D a\X,t) - DU{X,t) + H,y,) . (2.166) 

This is an extremely common model in quantum optics where an atomic dipole 
operator D couples with the rate 7 to a quantized quasi-monochromatic electric field, 
with "white noise" creation operator a''"(A,t). Hgyg is the remaining system operator 
which includes any residual detuning of the field mode from the system transition 
frequency or any externally applied controls. The quantum Wong-Zakai theorem 
states that this pre-limit Hamiltonian generates a propagator with the coefficients 

Gii = 0, Gio^V^D, Goi^-^D\ and Goo ^ -iH^y^-^^ D^D. {2.167) 

This results in the propagator Ut satisfying the QSDE 

dUt = (^^/^DdAl - ^D'^ dAt - iHsys dt - ^7 D dt^ Uf (2.168) 

Next section we will apply the quantum Wong-Zakai theorem to the much more 
interesting case of the Faraday interaction where d = 2 and Eij ^ 0. 

2.7 The Faraday Interaction 

The Faraday interaction is physically based on an optical field propagating in a 
polarizable medium. In classical optics it is used as a magneto-optical effect, where 
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the polarization of a linearly polarized probe is rotated by an amount proportional 
to the component of the magnetic field parallel to the direction of propagation. 
At a macroscopic level, it is modeled terms of the energy shift of a polarizable 
particle induced by an oscillating electric field. Such an energy shift can be easily 
implemented and precisely controlled by applying a quasi-monochromatic laser to 
a rarefied monatomic gas. The laser's effect on a single atom can be considered a 
perturbation to its atomic ground states, if the laser drive is in a "low saturation" 
regime. Specifically the laser's carrier frequency must be close to, but significantly 
off resonance from, a ground state transition. Furthermore the intensity must be 
small enough so that the total number of excited atoms will be negligibly small. 
Not surprisingly, the details of the ground state atomic structure effects both the 
magnitude and direction of the induced polarization and often leads to effects beyond 
a simple linear rotation of the probe polarization. The derivation of this kind of 
interaction is elegantly presented by Dcutsch and Jcsscn in [5]. Here we will consider 
the simplest of all settings where the atomic ground state is given by a spin 1/2 
particle. Such a ground state is experimentally realizably, if the atom has a single 
valance electron and negligible hyperfine structure, or if two valence electrons form 
a spin singlet ground state and the nucleus has a total spin I — 1/2. 

In either the classical or quantum mechanical setting, the polarizability Hamil- 
tonian for an atom located at position is given by 



where W is the polarizability tensor. Quantum mechanically, a is an operator 
acting solely on the atomic ground states. It is worth noting that in any real system 
there will be additional decoherence due to spontaneous emission, which in this 
treatment we will ignore. Shortly, we will note that the strength of the coherent 
interaction is proportional to F/A where F is the excited state decay rate and A is 
the probe detuning from the atomic resonance. One can additionally show that the 
incoherent photon scattering is proportional to F/A^. When F/A is small, F/A^ is 




(2.169) 
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smaller and so the decoherence is often ignored. 

The connection to the quantum Wong-Zakai theorem is that Hpoiar is quadratic 
in the field operators. As E''^-'(ra) oc and E^'\ra) oc a, the components of 
will be identifiable with the operators Eij. By ignoring spontaneous emission we are 
also able to restrict our attention to a single quasi-monochromatic paraxial mode. 
While in principle the atom couples to any electric field at its location, including the 
quantized vacuum, we will be applying a coherent displacement in a definite mode 
u^'^\xt, z), with an envelope function f{t). As we have seen this envelope function 
is expressible in terms a convolution with the smoothing function (p^'^\t), sec Sec 
2.6.1, and so the relevant field operators we will be considering are d{X,t) and its 
adjoint. In other words, we are simply using this as an example for applying all of 
the theoretical machinery developed in this chapter. 

Before finally writing down Hint{X,t), we will make one more extremely useful 
but only marginally justifiable approximation. Here we assume that the spatial 
distribution of the atoms will be irrelevant and all atoms in the ensemble can be 
treated as existing at the same location in space. Prom the point of view of the 
slowly varying envelope, this is a reasonable assumption if the dimension of the gas 
along the direction of propagation is on the order of ca. If the longitudinal extent of 
the atoms becomes significant then Hint would have to treat atoms at the beginning 
of the gas differently from the atoms at the end. In addition to having a spatial- 
temporal dependence, any realistic paraxial beam will have some intensity variation 
in both xt and z. If we take u'^~^\x.t, z) to be a standard Hermite-Gaussian beam, 
the transverse and longitudinal intensity can be treated as approximately constant if 
the gas has a transverse area that is small when compared to the beams characteristic 
area ut- However, if a beam of a fixed input power has a large transverse area then 
it will have a relatively low intensity at any give point relative to a beam with a 
smaller (Tt- Ultimately this means that the more uniform the probe is, the weaker 
the over all interaction will be. 
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With all of the above caveats and assumptions the Faraday interaction Hamilto- 
nian is 

HintiK t) = h^J, (a^A, t)ar{X, t) - aJ(A, t)ai{X, t)) (2.170) 

with the operators and constants defined though the following. For a spin 1/2 ground 
state, the single atom polarizability a is diagonal in the circular polarization basis, 

and e;. Up to an irreverent global energy shift ^ cx az{e*er — ej'e;) where 
is the pauli z operator and the proportionality constant depends upon the specifics 
of the atomic physics. By assuming that all of the atoms exist at the same location 
in space, computing the Hamiltonian for the whole ensemble reduces to computing 
a sum over the individual polarizabilities which further reduces to summing over all 
of the operators. It is well known that with N spin 1/2 particles, a collective 
pseudo-spin J can be defined whose components {i = x, y, z) are 

N 

J, = Y.\at\ (2.171) 

n=l 

The details of finding the proper proportionality to express the total interaction as 
Eq. (2.170) is given in [5] but the dimensionless constant xo has an exceedingly 
simple form with, 

where ctq is the resonant scattering cross-section for the given transition. This con- 
stant can be viewed as giving the probability that a single atom absorbs and remits a 
photon into the paraxial beam. In order to make all the above approximations valid 

{e.g. assuming that the intensity is near constant and that spontaneous emission is 
negligible) requires that both ctq <C (Tt as well as F <^ A, meaning that Xo ^ 1- 

When proving that the convergence of U{X,t) in the sense of matrix elements 
limA^-o (e[f(A)]| U{X,t) |e[h(A)]), Gough used the useful relationship that 

lim (e[f (A)] I U(X, t) |e[h(A)]) = lim (0 t/(A, t) 0) (2.173) 
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where U is the propagator, except with the replacements 

di(X,t) ^di(X,t) + hi{t) and dl(X,t)^dl(X,t) + f*(t). (2.174) 

As we are applying this limit when the field has a coherent displacement, it is suf- 
ficient to work with these displaced versions and pretend that the field is in the 
vacuum. The heart of Faraday interaction is the rotation of a linearly polarized 
input and so we will assume that our displacement is linearly polarized and that 

the equivalent amplitude function f (t) G £^ Cg) C'^ represents the displacement in the 
electric field. With these two constraints we have that 

/.W = /;W = 7s/oW (2.175) 

where fo{t) is real-valued. Typically experiments involving the faraday interaction, 
the driving displacement is operated in switched on to a constant value for the 



duration of an experiment. In this case max(/o) = \/Nl/t where Nl is the average 
number of photons in a pulse of duration r. 

In the case of this displacement, we have the effective vacuum Hamiltonian 

- {dliX,t) + f;it)){diiX,t) + flit))). (2.176) 

It is useful to define the function, 

<t)^{fMt)y. (2.177) 

When fo{t) is held at a constant level we have a characteristic rate k = (xo/3)^ Nl/t. 
From this definition we can expand out the interaction 

Hint{x,t)^hfj, (dl{x,t)dr{x,t) - d\{x,t)di{x,t)) 

^ ^ (2.178) 

+ inJ^j,{dl{x,t) + dl{x,t)-dr{x,t)-di{x,t)). 



Sec. 2.6 discusses the operators Eij as representing the scattering of field quanta 
in a system dependent way. In addition, the limiting coefficients show that when 
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Eij takes on relatively large values, there is a possibility for multiple scattering 
events. In the practical approximations that resulted in xo ^ 1, the probability 
for multiple scattering is relatively small, unless takes on obscenely large val- 
ues. Note that when fo{t) ^ 1, there can still be a significant interaction, as this 
means that the second linear term dominates the interaction. Also note that we have 
{ar{X,t) + di{X,t)) — ah{X,t), i.e. an annihilation operator for horizontal polar- 
ization. Dropping quadratic terms in favor of the terms with a large displacement, 
we now have an approximately linear interaction in a single horizontally polarized 
mode. 

Hint{X,t)^ih^/^J,{a[{X,t)-ah{X,t)). (2.179) 

Prom the 1-D example this means that we have Gio = •>/ K(t) Jz and so if we add 
a time-dependent system control Hamiltonian Hc{t) to this expression we have a 
propagator 

dUt = (^y/W) Jz dA^"^ - J, dA^ - \K{t) Jl dt - iH^{t) dt^ Uf (2.180) 

This is the propagator that we will be considering in Chaps. 4 and 5. 

It is worth noting that the Faraday interaction has been applied to several differ- 
ent continuous measurement models in the QSDE formalism with varying levels of 
initial assumptions [56, 57]. In [56], the free field was assumed to be well modeled 
by a QSDE and a Faraday like interaction was derived via adiabatically eliminat- 
ing an excited state as well as an artificial cavity mode which left many questions 
unanswered as the derivation was made strictly through a single mode picture and 
did not address the fundamental two mode structure of the Faraday interaction. 
In contrast [57] considered a scattering interaction, however there they simply took 
the fundamental scattering interaction before the displacement and substituted the 
scattering processes dh-Y and dA" for the white noise operators. What that model 
failed to consider was the effect of normally ordering the field operators in obtaining 
the proper Ito correction. In the language of the quantum Wong-Zakai theorem, the 
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propagator initially considered in [57] should have been interpreted as a quantum 
Stratonovich equation and not a Quantum Ito equation. 



2.7.1 The quadratic Faraday interaction 



It would be a shame to discuss the full solution to the quantum Wong-Zakai theorem 
and not give an example that retains the scattering interaction. The Faraday inter- 
action is a prime candidate for this, in a world where we have a weak drive fo{t) ~ 1 
but it is possible to see some kind of effect. From Eq. (2.178) we can identify 



E — —Ell — ^ J 

J-^rr — ■'-^11 — 3 




(2.181) 

and 



We can substitute these operators into the coefficients Gap in Eq. (2.164). After 
some algebraic simplifications we find that 
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App. C reviews the usual formulation of the propagator dUt, in terms of the operators 
Sij, Li and H. Some more simple algebra shows that 



„ _ ot _ ^ ^1)°'^^ 
^rl — ^Ir — 0, 



(2.183) 



H ^0. 



In order for dUt to be a unitary process, Sij must form a unitary matrix of operators, 
i.e. SjjSjk = SijSjf, = 5ik, which is clearly satisfied in this case. So finally the 
Faraday interaction generates the propagator Uf, which solves the QSDE 

dUt = ({Srr-l) dAY+{Sl-l) dAf+Lr {dA^^ -dAl)+Ll {dA^^ -dA[)~LlLr^Ut 

(2.184) 

with the initial value Uq — 1. In the small xo limit we were able to write the 
propagator in terms the linear polarized field operators A'l and A^^ . This is not 
the case here as the right and left polarization states have different atomic coupling 
operators. This might result in creating some system dependent elipticity to the 
probe laser, however more analysis is clearly needed. 

The power of writing the propagator in terms of the {S, L, H) parameters is 
that a large number of results have already been computed for general coefficients 
which can simply be applied here. As an example suppose we wish to compute the 
unconditioned master equation of the atomic system assuming that the displaced field 
is in vacuum, i.e. other than the coherent drive laser. Then the master equation 
is given in Lindblad form with jump operators Lj. and L/. Specifically the system 
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density operator p{t) is the solution to 

^ LrpLl- \LlLr p-\p LlLr + U p - \l\Li p-\p L\Li 

Jz ^ Jz ■^z n n -^z 



(2.185) 

Here we can see yet again that when xo — >^ we recover the standard dissipative 
master equation with measurement operator K{t) J^. Also note that when the system 
is prepared in either an eigenstate or a mixture of eigenstates of Jz then it does not 
evolve in time. 
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Chapter 3 

Classical and Quantum Probability 
Theory 

This chapter serves two purposes. The first and primary intention is to present a 
number of known resuhs from classical and quantum probability theory, which will 
serve as a foundation for the novel work in later chapters. Those results rely on a 
detailed knowledge of the (classical) statistical properties of the Wiener process and 
so we review them here. Additionally, we need to know how to extract a classical 
Wiener process from a fundamentally quantum system. The procedure of identifying 
a stochastic process embedded in a quantum system is one useful application of a 
more general mapping between quantum systems and classical probability theory. 
The second purpose of this chapter is to emphasize the power of this technique and to 
discuss how the language of classical probability theory can be used to identify certain 
symmetries that might exist in a quantum system. In order to do this coherently, 
we also review some of the basic elements of classical probability theory. 

By working in the language of classical probabihty theory, the tools of nearly 80 
years of classical mathematical analysis can be applied to quantum problems, with 
one important example being a continuous-time quantum filter. We will not rederive 
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it here, merely discuss its origins, limitations and various formulations. A particulary 
important form is the conditional master equation, an equation that is in some 
sense semiclassical and can be viewed as being generated by the quantum-to-classical 
mapping. Here we use the term semiclassical in the sense that the measurement 
record is modeled as a real valued classical stochastic process whose statistics are 
given by a quantum system expectation (see Sec. 3.4). Chaps. 4 and 5 work 
exclusively with this equation, albeit in three different variations. The final topic of 
this chapter is to show how these various forms are derived. 



3.1 Classical Probability Theory 

A physics Ph.D. program does not generally include a course in measure theory or 
axiomatic probability theory. Most physics problems only consider a handful of dis- 
crete or real- valued random variables and so applying a full measure theoretic context 
is unnecessary. However in some instances, working only with a probability density 
function becomes either intractable or conceptually problematic. One example is 
when one is attempting to understand the behavior of a random function defined 
over continuous time. In principle, this requires describing an uncountable number 
of random variables, one for each possible time, where the density function at a given 
time could be highly correlated with past (and maybe even future) times. 

Furthermore, when adding the possibility of statistical inference to the picture, 
defining individual density functions becomes even more convoluted. Consider trying 
to estimate the history of the random variable Xt based upon a continuous observation 
of a nonlinear function of a;, e.g. f{xt) = sin(a;t). Writing down a joint and marginal 
density functions for xt and f{xt) is not particularly straightforward, as they are 
clearly distinct objects but are hardly independent. In the long run, a much more 
efficient way of doing business is to decouple the notions of random events and their 
associated probabilities from the specifics of any one random variable. By finding a 
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way to associate Xt, f{xt), and maybe even a third random variable y to the same 
underlying structure of events, we can then calculate the probability associated with 
those events, independent of the specifics of x, y or f{x). The way this decoupling 
is made is by invoking some of the structure found in measure theory. 

An axiomatized probability model contains three elements, usually written as the 
triple J^, P), with fl being a sample space, J-' a a-algebra of events, and P is a 
probability measure over those events [22, 53, 58]. We will now discuss each element 
including specific examples. Ultimately, we are interested in describing diffusive 
measurements and so we will focus on the example of Brownian motion. Brownian 
motion is the canonical example for a system experiencing unforced diffusion and the 
Wiener process is the most widely used mathematical model for such a system. Chap. 
2 already encountered an instance of a Wiener process, in the vacuum statistics of 
the quadrature At + AI. 

The first element of a probability space, Q, is called the sample space and describes 
the set of all possible outcomes of the model. In a system with a discrete number of 
outcomes, a flip of a coin or a roll of a die, then fl is simply the set of all possible 
outcomes. For the coin = {heads, tails} and the die Q — {1,2,3,4,5,6}. In 
addition to these discrete examples, the sample space could also be uncountably 
infinite. For a Brownian particle, moving in d dimensions. The sample space is the 
space of all possible trajectories. As a particle's trajectory must be a continuous real 
valued function, fl is then then the set of all continuous functions of time [58] 

fl = {uj{t) : R+ W^, u continuous} . (3.1) 

The next element of the probability model, J-', is a a-algcbra over the sample 
space. This represents any "sensible" question we can ask about the various out- 
comes. Each object in the algebra represents such a question and is called an event. 
In this formalism, probabilities are computed not from the individual outcomes in 
fl, but instead from the events in T. The reason for this distinction is to exclude 
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pathological cases that arise when working with uncountable sets and is the same 
reason measure theory was developed. When the sample space Q is uncountably 
infinite, one can find highly pathological sets that can be used to obtain paradoxical 
results. For instance, by choosing just 5 disjoint subsets from the unit ball, one can 
construct, simply through translations and rotations, two independent and identical 
copies of that ball [59] . It would be problematic for a probability model to consider 
these kinds of sets, as one could then double the probability for picking a point in 
the unit ball simply by doubling the ball. Identifying the elements of J-' with sensible 
questions means that we are excluding these kinds of pathologies. 

In the discrete case the sensible questions are things like, "Did the die land 
with an even number?", "Did it land showing the number 6?", or even "Did it 
land showing any number 1 though 6?". Mathematically, these questions represent 
sets of the underlying outcomes. These correspond to the sets {2,4,6}, {6}, and 
{1, 2, 3, 4, 5, 6} = respectively. The cr-algebra T is the set of these sets, representing 
any possible question -event- we can ask about the system. For a finite and discrete 
number of outcomes, T is usually the power set, in that it is the set of all possible 
sets one can make out of Q. Operationally speaking, a cr-algebra has the following 
definition [53]. A cr-algebra ^ is a collection of sets of fl satisfying the following 
three properties^, 

1. If a countable number of sets {^n}„e]N ^ ^ then U„>1„ G T. 

2. If ^4 is a set in than its complement, A*^, is also in 

3. J-" must contain the space Q, and therefore by the second property, its comple- 
ment the empty set {0}. 

For the case of Brownian motion, T is the cr-algebra of all "cylinder sets" , which 
are defined in the following way [58]. In real valued random variables probabilities 
^The "cr" in cr-algebra is to mean "countable" [58]. 



Chapter 3. Classical and Quantum Probability Theory 



93 



are given in terms of intervals. The probability that a random variable x, with 
probability density p{x), has a value in the interval [a,b] is given by the integral 
J^dxp{x). Here the event is the interval [a,b] and is an element of the Borel a- 
algehra, B. This is essentially the set of all intervals, open and closed over, the real 
line. 

However, at any given time, a d-dimensional Brownian motion will take on values 
in M.^. In order to ask if a trajectory landed in some interval, /, we must also specify 
an associated time, t, for that measurement. A basic cylinder set is then specified 
by both a time and an interval. The actual set, C{t;I), is the set of all Brownian 
trajectories that are in I at time t, 

C{t; 7) = {a; e Q : uj(t) E 1} . (3.2) 

A trivial example is the set C(i,R'^) = fi, i.e. all continuous trajectories will have a 
value in R'' at any time t. A nontrivial example in one dimension is to ask for the 
set of all trajectories that are between a — -10 //m and 6 = 5 //m at time t — 5 ms. 

In addition, questions that involve multiple times are also sensible. It is per- 
fectly reasonable to ask, "what one-dimensional trajectories are in Ii — (ai,6i) 
at time ti and in I2 — [02,^2) at time t2 > ii?" This is also a cyhnder set, 
C(ti,t2; h, h)- An image that might be helpful is to imagine that the cylinder set 

C(ti,t2, ■ ■ - tn', h, h, ■ ■ ■ In) defines the set of trajectories that successfully navigates 
the "slalom" defined by these intervals and these times. 

The (j-algebra we will use for analyzing Brownian motion is the cr-algebra gener- 
ated by the cylinder sets defined by all countable sequences of times and all open sets 
of M'^ at those times [58] . Note that the issues of discussing a uncountably infinite 
number of random variables is avoided by defining the cylinder sets for a countable 
number of times. In fact asking questions about an uncountable number of events 
is ultimately identified as "unreasonable" as it allows for the introduction of patho- 
logical possibilities. Here we are only interested in describing the continuous sample 
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paths of a Brownian particle and means that we can safely consider a countable 
number of events, e.g. times defined by a sequence of rational numbers. 

The final element of a probabihty space is the probabihty measure, P. It defines 
the probability for observing the events in T. Mathematically P is a function that 

takes sets of Q, (elements of and maps them to real numbers between zero and 
one, P : ^ ^ [0, 1]. In order for a valid measure to be a probability measure we 
must have: 

1. The probability of something happening be one, P(0) = 1, and the probability 
of nothing happening be zero, P(0) = 0. 

2. The probability of the union of a countable number of disjoint events in J-" 
must be additive, 

P(U„A„) = J2 if A„ n = for An, A„ e J- and n ^ m. (3.3) 

n 

The requirement that a probability measure be countably additive is simply a state- 
ment that if A is independent of B then the probability to observe or is the 
sum of the two probabilities. 

Shortly we will discuss what the probability of observing a given cylindrical set, 
if the trajectories in those sets represent unforced Brownian motion. 

3.1.1 Stochastic processes and random variables 

From a well constructed probability space we now need to sec how random variables 
fit into the measure theoretic context. Much more can be said on this topic than we 
can include here, so an interested reader is encouraged to consult [22, 53, 58, 60]. 
Chap. 5 requires a reasonable understanding of the statistical properties of a one- 
dimensional Brownian motion, the Wiener process, and so we will focus on that 
example here. 
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Abstractly, a random variable / is a function that maps elements of fl to another 
space, usually the real numbers. Placing a $50 bet that a coin toss will land heads 
is an example of a random variable. Another example of a random variable the 
indicator function X^(a;) for any event A & J^. Chap. 2 already found many uses for 
an indicator function, which in a probabilistic context, is a random variable defined 
as 

f 1 iioj eA 

Xa{co) = { . (3.4) 

[ if CO ^ A 

Such a random variable is deceptively simple but is also extremely useful. One of 
its primary uses is that they relate set operations in T to algebraic operations on 
random variables. It is easy to show that the random variable x{u!) 
is equal to 1 whenever u e AU B. Also the random variable y{u!) =Xa{'^) ^^(a;) is 
equal to 1 only when u) G An B, meaning 

Xaub{uj) ^Xa{co)+ Xb{uj) (3.5) 
XAHBico) =Xa{cu) XBicu). (3.6) 

For the case of ID diffusion, one of the most important random variable is pa- 
rameterized by time and simply returns the value of trajectory at that time. For all 
times t > 0, we define the function xt : fl ^ M. such that 

xt{u;)^uj{t). (3.7) 

This definition might seem a bit pedantic, but note that the trivial random variable 
y{uj) = a; is not real-valued, uj describes the entire trajectory not just at any one 
specific time. From Xt{uj) a whole host of other random variables can be defined 
through functional composition. A man could place a $50 bet on whether or not a 
diffusive particle will be greater than -|-5 fim from its starting point by time t — 1 
ms. That bet is the composition b{xims{<-^)) where b maps real number to ±50. 

The random variable Xf in Eq. (3.7) only gives a snapshot of a trajectory at that 
time. In order to describe the trajectory dynamically in time as a random variable. 
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there is the notion of a stochastic process. Most generally, a stochastic process is 
a family of random variables {xt}^^^ indexed by some parameter t, almost always 
representing time. Typically we will take time to start at and either let it continue 
off towards infinity or, when convenient, stop at some finite time. When discussing 
the concept of the process we will use the notation {xtj^^Q and Xt is the random 
variable given at that time. Before discussing a couple important types of processes, 
we should know how to compute the probability for a random variable to evaluate 
to a range of values. 

The previous section showed that a probability measure P acts on elements of J-" 
and returns probabilities. To compute the probability of a $50 dollar bet b to win, 
we need to identify the set of events that the function b : fl ^ ±50 evaluates to 
50. Because 6 is a function acting on ft, we can also consider its inverse map b~^. 
If a given random variable can take on a continuum of values, we can still run into 
pesky problems of having uncountable numbers of things. The solution to this is 
to again only consider sensible sets of outcomes for any given random variable. For 
the random variable Xt from Eq. (3.7) we will have to take its inverse map x^^ to 
act only on elements of the Borel cr-algebra, B. When we ask for probabilities of 
observing certain values of a random variable x, we must ask for the probability of 
observing sets or intervals in the range of x. 

For every "reasonable" interval that x maps to, there must be a corresponding 
element A e in order for us to be able to calculate the probability of that under- 
lying event. Such a random variable is called measurable. If a random variable x is 
not measurable, then there is little we can say about it when its outcomes lead to 
unreasonable questions. In other words, a nonmeasurable random variable has an 
inverse that generates sets not in Faced with this possibility we can either ignore 
such questions and pray they never occur or redefine the probability space in order 
to make these sets measurable. A nontrivial example of this problem is suppose 
we had a random variable i/t that returned the value 1 whenever the sample path 
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exhibited a discontinuous jump in the time interval [0, t) and zero otherwise. The 
question, "what is the probabihty of yt returning 1?", corresponds to P(?/(~^(l)). If 
our probabihty space is constructed only of continuous functions, then we technically 
can't answer this question as the pre-image y^^{l) ask for the set of functions that 
have a discontinuity for times < s < t, which is not an element of T. 

By defining random variables as measurable functions, wc can easily relate the 
statistics of multiple random variables to each other though their inverse maps. 
Consider the stochastic process {xtj^^Q defined by Eq. (3.7). Then xt^ and Xt^ are 
two random variables taking on values in the real number line. Suppose we wish to 
calculate the probability of observing Xf^ in the interval (a, b) and Xt2 in the interval 
{c,d]. Individually, we have 

x;^'{(a,b))^C{t,;{a,b)) (3.8) 

and 

xr/((c,d])=C(i2;(c,d]) e J- (3.9) 

The joint probability of these two events is simply the probability of the intersection 
of these two sets, 

P(c(ti; {a,b))nC{t2; (c,rf])) =p(c(ti,t2; (a, 6), (c,ci])) (3.10) 



3.1.2 Expectation values, the conditional expectation, and 
measurability 

The most fundamental operation one performs with random variables is computing 
their expectation values. If the random variable z{u!) takes on a finite number of val- 
ues, {z^*^ : i — 1, . . . ,n^, then calculating the expectation value for z is no different 
than in the nonmeasure theoretic context 

n 

E(^) = ^z« P(^ = 2(*)). (3.11) 
1=1 
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The expectation value of z is the average of all its outcomes, weighted by how likely 
they arc to occur. Note that writing P (2; = z^'''^'^ is shorthand for finding the event, 
Ai = z-^ (z^^) with 

P{z = = P{Ai) = P{{ujeQ : z{uj) = z^'^}). (3.12) 

In addition to these simple random variables, we need to formulate expectation 
values for random variables that can take a continuum of values. This is done 
by defining a measure theoretic version of a standard Riemann integral, called the 
Lebesgue integral. One path for this construction is to make an approximation for 
Xt that takes on a finite number of values. The expectation value of such a discrete 
approximation is easily computed though Eq. (3.11). Then by taking a suitable 
limit where the number of values become continuous we can calculate the proper 
expectation value. At this point this procedure is a bit vague, but as we have not 
specified a measure for Brownian motion, it is difficult to be more specific. When 
discussing the Wiener process, we will be able to be more clear. 

Probability theory gets a lot more lively when instead of considering simple expec- 
tation values we consider conditional quantities. When working with simple random 
variables finding a conditional expectation values is no harder in a probability space 
than in a standard context. No matter how sophisticated the framework, Bayes' rule 
still applies, in that for two events Ai and A2 we have 

P(Ain/l2) =P(^l|^2)P(^2). (3.13) 

Whenever P{A2) 7^ we can invert to find the conditional probabihty of Ai given 

We emphasize that we are calculating the probability of an event Ai occurring, 
conditional on the event A2. While A2 could correspond to the pre- image of a 
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single value of a simple random variable, it could also correspond to another random 
variable taking on a range of values or even the intersection between the outcomes 
of two random variables. All of these different possibilities correspond to the same 
underlying event and thus carry the same information. 

Turning this conditional probability into a conditional expectation value is simply 
a matter of weighting the outcomes of one random variable by the conditional prob- 
abilities. We will illustrate this in a quick example. Consider the two simple random 
z{u!) and y{uj) with values {z^^\ . . . , z^")} and {y'^^\ . . . For each variable 

and each outcome find the corresponding events, Ai — z'~^{z^'^'>) and Bj — y~^{y^^). 
Then the conditional expectation value of z given that y = y'^^^ is 

E {z\y = = ±z^^ P (. = = ^±z^^ ^^^7^- (3-15) 

i=l i=l ^ ^' 

If one is having to compute this conditional expectation value by hand, hopefully the 
events Ai and Bj are relatively simple and that computing the probability of their 
union is relatively straight forward. But even if this is not the case, by adding the 

structure that events are sets and probabilities are measures on events, computing 
conditional quantities does not require defining a new probability structure like a 
joint density function for each random variable we want to consider. 

Sometimes it is convenient to write the conditional expectation not directly in 
terms of the events Ai and Bi, but instead in terms of a regular expectation value and 
an indicator function. This method will be useful when both z and y are not simple 
random variables, also when we want to abstract away y and instead think about 
conditioning on just some abstract event B. A nice property of indicator functions 
is that because they only the value 1 on a single event, we can write 

E(X^J = P(A) (3.16) 
and in particular, 

P {A, n 5,) = E(X^,nB,) = H^aM). (3.17) 
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As the expectation value is a linear operation we have that 



i=l 



n 



E(Xb.) E(Xb.) E(Xb.) 



(3.18) 



The last equality here is particulary useful as it holds even when z isn't a simple ran- 
dom variable. When continuous-time and continuous-valued random variables are 
involved, it should not be surprising that explicitly computing conditional quanti- 
ties by finding the underlying sets and computing their union is often impracticable. 



tions, often an explicit computation is still impractical. Instead, taking an indirect 
root is often fruitful and one of the prime tools for doing so is what is called the 
conditional expectation. 

Moving from a conditional expectation value, to a conditional expectation, is 
only one step more complicated. If one is prepared for computing the conditional 
expectation value for every outcome of y, i.e. having computed all of the sets Bj and 
know which have zero probability, you can write down a random variable that in some 
sense computes all of the conditional expectation values at once. The conditional 
expectation of 2; on y written as Ei{z\y), is a random variable that takes on the 
value E (^z\y = y^^^^ whenever y = y^^\ This works through the following. For every 
outcome y^^^ we have an underlying event Bj. Whenever P(-Bj) 7^ we can define 
the random variable. 



Whenever P(-Bj) = we can give the conditional expectation value any value we 
wish, safe in the knowledge that the probability for obtaining such an arbitrary values 
is zero. A lot can be said for this object, but one of the most important is that it can 
be viewed as a reasonable estimate for z given information about y. More specifically 
suppose you wanted to find a least-mean-squared estimate for what value z would 
return when the outcome you receive, u), results in y{u) = y^'^\ Generally speaking. 



While working with the indicator function Xb^ niakes some substantial simplifica- 




(3.19) 



3 
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there are many different a; for any value of y^^^ also the set of uj that gives this value, 
may return multiple values for z. It turns out that if you want to make any estimate 
for any random variable, z, conditioned on the events given by another, y, then the 
estimate for z must have the form 

m 

5(w) = J]a,- XB,(a;) (3.20) 
i=i 

where Bj arc the events generated by the various outcomes for y and aj are constants. 
It should not be too surprising that by finding the constants that correspond to the 
least-mean-square estimate for z given y turn out to be the conditional expectation 
values aj — M (^z\y — y'^^^). 

Any random variable that can be written as Eq. (3.20) is called a y-measurable 
random variable. Note that y itself and almost any function of y can written like 
this. The sets Bj can be used to construct a cr-algebra by taking all (countable) 
unions and complements, then the random variable z is measurable with respect 
to that cr- algebra. The u-algebra formed by these sets, written as y or sometimes 
(t{|/}, is called the sigma algebra generated by y. It is not difficult to show that if 
you have multiple random variables x, y, etc. you can form a a-algebra, a {x, y, . . .}, 
generated by all of those as well simply by taking unions and complements of all of 
their various events. 

All of the above concepts, joint probabilities, the conditional expectation, gener- 
ating (T-algebras from random variables, etc., can be extended to continuous random 
variables by taking appropriate limits. More often than not, one tries to avoid taking 
an actual limit, and instead looks for a random variable that is y-measurable and 
satisfies the basic properties of the condition expectation. For general reference, here 
are some of those properties: 



1. The conditional expectation is linear, 

M{ax + /3z\y) = aE {x\y) + /SIE {z\y) 



(3.21) 
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for constants a and /3. 



2. The conditional expectation is consistent with usual expectation values, 

E(E(a;|y) ) = E(a;). (3.22) 



3. The conditional expectation of any y-measurable random variable, x, is itself 
M{x\y)^x. (3.23) 



4. If z and y are independent than the conditional expectation is just the expec- 
tation value for z (times the "identity" ) 

M(z\y)^M{z) Xn(uj). (3.24) 



5. For any y-measurablc random variable y', the conditional expectation value 
satisfies the property that 

E{E{z\y)y')^E{zy'). (3.25) 



while this last property could be inferred from the second and third, we include it 
here because it is often what is used to "guess" what the conditional expectation is, 
without going though a bare bones construction. 

Much, much more can be said about the conditional expectation. The deriva- 
tion of classical and quantum filtering theory is based upon computing a conditional 
expectation of some unobserved processed, {xt} based upon measurements of a cor- 
related process {yt}- Unfortunately, we will be unable to go into the detail here 
but interested reader is encouraged to seek out a number of good references on the 
subject, some of which are [22, 25, 37, 53]. 
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3.1.3 Special processes - time-adaption and martingales 

Having discussed random variables and stochastic processes in terms of a classical 
probability space {Q,J-',P), we now need to introduce a couple of important and 
useful processes. Chap. 2 already introduced the concept of a time-adapted process. 
An stochastic process {xt}^>Q is time-adapted when it depends only on events defined 
in the present or past and not on the future. Having introduced the concept of a 
measurable function and a u-algebra over the cylindrical sets C(ti,t2, ■ ■ ■ ; h, h, ■ ■ ■) 
we can easily give a precise meaning to time-adapted process. A stochastic processes 
{^t}t>o time-adapted when each random variable Xt is measurable with respect 
to the (7-algebra, Tt, generated from the cylindrical sets C{ti, . . . ,tn < t; Ii, . . . , In)- 
(Sometimes a more general definition avoids using these specific cylinders and simply 
uses an indexed sequence of cr-algebras J^o C J^s C J^t C called a filtration [53] . A 
process adapted to this filtration is called J^j-adapted.) In the context of statistical 
estimation, working with time-adapted processes is essential, as these are the pro- 
cesses that are independent of any future events. Within the bounds of time-adapted 
processes there are an additional type of stochastic processes that have special and 
simplifying characteristics in terms of their conditional statistics, called martingales. 

A martingale is an important kind of stochastic process which plays a crucial role 
in classical probability theory. More importantly for our purposes, they will play a 
crucial role in significantly simplifying a quantum conditional master equation as we 
will see in Sec. 3.4.1. They are used to represent fair betting games where no amount 
of past information is helpful in predicting future events. The defining property is 
that the conditional expectation of any future value of the process is simply given by 
its current value. You expect to leave a fair casino with the same amount of money 
as you had when you entered^. In essence a martingale is a random process where 
the conditional mean of any future increment is zero, [22, 53, 60]. To illustrate this 
property, consider taking a fair coin and flip it N times. A typical sequence uj may 

^In contrast to a real casino. 
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be something like, 



u;^{H,T,H, H, H, H, T, T, H,...}. 



(3.26) 



For any sequence u we can create a random variable Xn which is equal to the number 
of heads minus the number of tails seen in the first n flips. So that for this sequence 



In the case of a fair coin, Xn is a martingale. 

To see why, note that in each flip there is equal probability of the coin landing 
heads or landing tails. So that for any n we have the expectation value. 



In this speciflc realization, after the flrst four flips X4 is not 0, but is in fact 2. 
However, because any future flip are independent of the past we should not expect 
to see any more heads than tails. This means that conditioned upon the flrst four 
outcomes we should not expect for a;„>4 to be 0, but instead it should average around 
2. In other words 

E(xm — Xn\ {xi, X2 ■ ■ ■ Xn}) — for all m >n. (3.29) 
This is the fundamental property of a martingale which is usually written as, 

E(xto| {xi, X2 ■ ■ ■ Xn}) — Xn for all m>n. (3.30) 

Now imagine that that the coin was not in fact fair. Say that the probability 
for heads was Ph — 2/5 and the probability for tails was Pt — 3/5. Then in this 
case Xn is not a martingale, as we would instead expect Xn to trend negative. Or 
in other words E(a;m| {xi^X2 ■ ■ -Xn}) < Xn for m > n. But while Xn may not be 
a martingale, it is sometimes possible to construct from x another random variable 
that is a martingale. This kind of process is called a semi-martingale and deflnes the 
class of processes that are capable of being used in an Ito integral. The fact that 
this is possible will play a crucial step in flnding a maximum likelihood estimate in 
Chap. 5. 



x = {l, 0, 1, 2, 3, 4, 3, 2, 3,...}. 



(3.27) 



E(x„) = 0. 



(3.28) 
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3.1.4 The Wiener process 

One of the most important processes one can consider is the Wiener process, a 
mathematical model for Brownian motion named for Norbert Wiener. In addition to 
elegantly describing diffusive motion, the Wiener process is used to model nearly any 
systems interacting with white noise. Chap. 2 already found an application outside 
of diffusive motion, that is the statistics of the quadrature Qt and Pt under vacuum 
expectation. This section reviews of the properties of the Wiener process, including 
how it relates to a classical probability space introduced in Sec. 3.1. 

The defining characteristics of a Wiener process are two fold: 

1. A Wiener process makes a continuous trajectory in time, with probability one. 

2. A Wiener process has increments that are independent, mean zero, Gaussian 
distributed random variables with a variance given by the increment's time 
duration. 

The first property is obvious while the second is a bit more involved and requires 
some explanation. Consider the stochastic process {wt}f^Q. For times < s < t, 
we can define the random variables, a = — wq and b = Wt — Wg- If {wt}t>o 
is a Wiener process then a and b are statistically independent and a is a mean 
zero, Gaussian random variable with variance s and b is also a mean zero Gaussian, 
with variance t — s. Sec. 3.1.1 found that if a random process has statistically 
independent increments and each increment is mean zero then its a martingale, and 
so the Wiener process is also a martingale. There are many more interesting and 
sometimes nonintuitive properties that one can calculate for a Wiener process, see 
[58, Chap. 2]. Some interesting facts are {i) a Wiener process is nondifferentiable 
with probability one and (ii) if at some time t a Wiener process takes on the value 
w{t) — a then it will take on that value and infinite number of times in every interval 
[t,t + At]. 
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The statistical properties of a Wiener process are deceptively simple, and yet 
exceedingly rich. The second defining property allows us to find a connection be- 
tween this simple statement and a nontrivial probability measure over the space of 
continuous functions. While we just used the times < s < t to demonstrate what 
this property means there is nothing stopping us from using a countable sequence 
of times < ti < t2 < ■ ■ ■ ■ We then know that for a Wiener process the random 
variables Awj = Wti—Wti_^ are all independent mean zero Gaussian random variables 
with variances Atj = it — ti-i- For each time ti we can calculate the probability that 
the Wiener process lies in the interval /j — (a^, bi). This ultimately turns out to be 

. /.}) ^ /_ rfw, dw. . . . n (^e---^) , (3.31) 

which is known as Wiener's discrete path integral [58]. Notice, that by picking the 
sequence of times < ti < t2 < • • • and the intervals = {ai,bi), wc just defined 
a cylindrical set, C{ti,t2, ■ ■ ■ ; Ii, I2, ■ ■ ■)■ Eq. 3.31 is a probability for observing a 
continuous trajectory to lie within this set, and therefore we can use these integrals 
to define a probability measure on the space of continuous functions uj : R"*" — > R. 
Not surprisingly, this is called the Wiener measure 

P(cit^,t2, ... -,11,12, ...))= P{{wu e Ii}). (3.32) 

It is worth noting that under this measure, all trajectories which do not have a;(0) = 
are given zero probability, i.e. 

F(^{iu en : cu{Q) = 0}^ =1. (3.33) 

This brings us back to the probability space {fl, T , P) defined over the continuous 
functions uif) with the cr-algebra T over the cylindrical sets. If the probability 
measure P is the Wiener measure as defined above than the stochastic process, 
{yjti^jj) = u}{t) : <t < 00} is Wiener processes. 
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3.2 Quantum Probability Theory 

These same concepts can also be applied to quantum theory either directly or with 
some modification. The mathematics of quantum stochastic calculus and noncom- 
mutative probability theory is a broad and detailed subject, one that is beyond 
our scope. Reasonable introductions with an emphasis on filtering can be found in 
[25, 61] and with more detailed treatments in [23, 45, 52]. However a certain amount 
of review is necessary in order to address the physical implications of the formalism. 
Before discussing the truly quantum nature of noncommutative probability theory, 
we will discuss its similarities with the classical theory. 

3.2.1 Embedding the quantum into the classical 

This section reviews how to constructs a classical probability space from a set of 
mutually commuting quantum observables. The purpose for this review is two fold. 
First, the quantum filtering problem relies upon this kind of mapping. The contin- 
uous measurements we will be making is described by a set of mutually commuting 
operators which is increasing in time. The eigenvalues that we receive will be viewed 
as a (read mapped to) classical random variables on a classical probability space. 
The second reason for this review is to emphasizes its limitations. In Chap. 5 the 
quantum filter is used to estimate an unknown initial state of a qubit. A natural tool 
in classical systems is the smoother which is an estimate for an unobserved system 
at some past time, given measurements up to some current time. However, naively 
applying this classical technique violates a necessary condition that allows for the 
classical mapping. 

In classical probability theory we found that random variables could be viewed 
as functions mapping elements of the sample space to real numbers. At its most 
practical level, quantum theory is used to predict the outcomes of experiments where 
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the measured observables are represented as Hermitian operators acting upon some 
underlying Hilbert space. The first step in bringing classical probability theory to 
the quantum is to formulate an analogy between classical random variables and 
Hermitian operators. 

classical •<-> quantum 
x{uj) -H- X 

This is a natural analogy, as the basic operation in classical probability is to calculate 
the expectation values of random variables. 

The next association is that in the classical theory we have the probability mea- 
sure P to calculate expectation values while in the quantum we have the system 
state p. This analogy is best illustrated in a discrete example where the classical 
sample space is the set of a finite number of d realizations, Q = {c^i, . . . Ud}. The 
cr-algcbra T for this space is then the power set of Q and the probability measure 
P is completely described by the probabilities of the singleton events Pi = P({a'i}). 
The classical expectation value of a simple random variable x{uj) in this space is then 



d 

E(x) = ^:c(a;,)P({a;,}). (3.34) 

1=1 

In the quantum case a system described by a Hilbert space T-L of dimension d is 
equipped with a positive trace one density matrix p. Expectation values of operators 
X acting on K are of course calculated as 

M(X) = Tr(pX) (3.35) 

which will sometimes also be notated as {X) . Thus we have the correspondence 



classical •<-> quantum 
E{x) O Tr(pX). 
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Rather than this loose analogy, a formal equivalence is possible where certain aspects 
of quantum theory can be embedded into a classical probability space. 

While the classical probability space has a fixed, albeit sometimes abstract set of 
realizations Q, identifying such a set in quantum mechanics is problematic. In the 
spirit of deterministic classical physics, the sample space Q most often represents 
locally realistic fates of the system. The probability of observing certain events 
is given by the probability measure P which act on subsets of Q. The utihty of 
probability theory is that we have an event A — {uj&D,\a< x{uj) < b} which has 
a concrete meaning for multiple random variables, not just x but f{x). In addition 
there could certainly be another random variable y such that when it take on the 
values c <y < d whenever x is observed in the interval [a, b] , and so both correspond 
to the same underlying event A. It is clear from Bell's theorem that this locally 
realistic interpretation of Q is not consistent with quantum mechanics. 

A less ambitious task is to find a classical probability representation that is capa- 
ble of describing the joint statistics of compatible observations. Compatibility of two 
observables X and Y means that [X, y] = and more importantly they share a set 
of eigenvectors {|ej)}. For each eigenfunction we have the projector Pj = | ei)(ej | 
and the operators X and Y have the spectral decompositions 



Note that the eigenvalues Xi and yi need not be distinct as they could have degenerate 
subspaces. 

In a d-dimensional system there are at most d distinct, mutually orthogonal 
projectors associated with a set of commutating operators C — {X, Y, Z, . . .}. If we 
associated projectors {P^, = | A)( A |} so that 




and 




(3.36) 



d 




(3.37) 



A=l 
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and 

d 

E{X) = J2xx^<pPx). (3.38) 

A=l 

The mapping between discrete quantum mechanics and classical probability is to as- 
sociate the set of labels for projectors with the sample space in a classical probability 
space. Then we have Q = {Aj : i — 1 . . .d} and T is simply the power set of Q. 
Prom this assignment, the probability measure is simply the quantum expectation 
value of the associated projectors. For example, probabihty for the event {Ai, A2} is 



P({Ai, A2}) = Tr(p P{,,,,,}) = Tr(p I Ai )( Ai I) + Tr(p | A2 ) ( A2 |). (3.39) 

In classical probability the simplest of simple random variables are the indicator 

functions Xp (for every set F E J-') which correspond to the projectors in the X 1— )■ 
x{u) mapping. This procedure is formalized in Theorem 2.4 of [25] and is summarized 
in Table 3.1. 



Classical 


Quantum 


Q 


{Ai, A2, ...} 




{{Ai}, {Ai, A2}, ...} 


P({A,}) 


Tr(p Pa. ) 




X 



Table 3.1: The spectral mapping between a set of commuting observables and 
a classical probability space. 



The above discussion extends also to the case of infinite dimensional Hilbert 
spaces and operators with continuous spectra [25]. For any "normal" operator^ A, 
which may take on all values in R, there exist a spectral decomposition for A such 

normal operators is one that commutes with its adjoint and so has a spectral decom- 
position. It can be written in terms of commuting Hermitian and anti-Hermitian parts. 
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that 

A= I aP{da) (3.40) 

where P{da) is the spectral measure, also called a projection valued measure, associ- 
ated with A taking on values in interval da. Often explicitly constructing P{da) is 
a little tricky, especially if one must first identify any vectors ip in Hilbert space for 
which (f) = Alp is not well behaved. ( Hermitian operators whose eigenvalues span 
the entire real line, so called unbounded operators, exhibit these kinds of problems. 
A trick for dealing with this case is to compute the spectral measure for the bounded 
operator T — {A + This works because any function f{A) commutes with 

A, therefore they share the same projectors. When T takes on the complex value 
A there is then the corresponding value for A, a = — i.) Once armed with a 
spectral measure for A we can then find an equivalent classical probability model, 
whose sample space are labels for the possible values of A. 

Regardless of whether or not the associate operators are unbounded, we empha- 
size this spectral mapping is only applicable to a subspace of operators which all 
commute with the underlying projectors. While this may seem to indicate that the 
mapping is severely limited, in practice it is extremely useful for describing ancilla 
assisted measurements. If one is interested in computing conditional expectation 
values for operators that commute with the projectors defining the classical space, 
then the quantum-to-classical mapping is still applicable. 

3.2.2 Quantum probability 

The spectral mapping to a classical probability space lacked a representation that is 
independent of the specific choice of projectors. Furthermore, Bell's theorem shows 
that there are no locally realistic sample spaces consistent with quantum mechanics. 
The first step in discussing quantum mechanics in a probability theoretic framework 
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is to omit the sample space fl [62]. At the level of making practical calculations, the 
sample space provided an underlying structure for associating random variables with 
the probability measure. By observing a random outcome, x{uj) — awe were able to 
see what event this corresponds to and then calculate the probability for that event 
given the measure P. In other words, we identify the set of possible realizations that 
are compatible with this observation then evaluate the probability for this event. 

In quantum theory the underlying Hilbert space of the system, T-L, provides this 
necessary structure. By making the association between Hermitian operators and 
the results of experiments we already have the necessary mapping between random 
variables and probabilities. In the above spectral mapping between quantum to clas- 
sical, we associated events with collections of possible eigenvalues and so even in the 
infinite dimensional case, the probability of observing an event is given by the expec- 
tation value of the corresponding union of projectors. In the fully quantum case, we 
need to consider all possible projections, not just those projections that commute. 
The mathematical object that is guaranteed to contain all possible projections is 
a *— algebra (read "star" -algebra) of operators^. Therefore the correspondence be- 
tween the cr— algebra J-' in classical probability space is a *— algebra A of operators 
on H. 

A *— algebra of operators acting on a Hilbert space, H is defined as the set of 
operators A so that 

1. A contains all complex linear combinations of its elements. For aR A, B E A 
we also have C = ciA + C2B e A for any complex coefficients ci and C2. 

2. A contains all adjoints of its elements. A e A imphes that A^ e A. 

3. A contains all products of its elements. A,BeA imphes AB e A. 

"^The * in the name comes from the mathematical convention of using * to represent an 
operator adjoint rather than ^ . 
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4. A contains the identity 1. 

In the finite dimensional case where H — the largest *— algebra acting on H 
is simply the space of all complex n x n matrices. However the reason for 
introducing this algebraic structure is not just for a love of mathematical formalism. 
In the same way that a set of classical random variables generate a cr— algebra, 
a{x, y,z, . . .} (see Sec. 3.1.2) a set of operators generate a *— algebra. For example, 
the above spectral mapping means that there is a *— algebera of operators generated 
by the set commuting of projectors {Pa}- The fact that a *— algebra generated from 
a set of commuting operators still commute means that a commutative *— algebra is 
the set of operators that define the events in a classical probability space. 

When H is infinite dimensional, defining a suitable *— algebra (or sub-*— algebra) 
becomes much more tricky. This is particularly true when trying to show that any 
limiting sequence of operators in the *— algebra is still in the algebra. As one might 
imagine, taking limits of unbounded operators becomes problematic as a sequence of 
operators might converge when acting on one set of vectors, but diverge when acting 
on another. The details of how one solves these issues is beyond our scope. Suffice 
it to say, the solution is to first consider only bounded operators (while keeping the 
T — (A + trick in mind) and then include the limits of all the sequences of 

operators in the algebra that converge on a class of well defined states. The technical 
name for such an algebra is a von Neumann algebra [25]. Generally though, one 
hardly ever needs to apply this kind of construction directly. 

One additional concept that is useful, particularly for discussing a quantum con- 
ditional expectation, is that of a commutant. Suppose you are given a set of operators 
S and you want to know what are the set of operators that commute with S. That 
set is call the commutant and is notated as <S'. To see why this idea is import consider 
the following. Assuming that you are able to measure the operators {Fi, 1^2, ■ ■ ■ , Yn} 
and that all of these operators commute with each other. We just showed how to 
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form a commutative von Neumann algebra from this set, but one might wonder if 
that's the whole story in a quantum to classical mapping. The answer turns out 
to be no. The wiggle room is from the fact that the number of distinct eigenvec- 
tors made from the operators {Yi,Y2, . . . , Y^} may not be equal to the dimension of 
the underlying Hilbert space. In that case, you can find operators A and B where 



One example is in a two qubit system. Suppose you only measure cr^ on one qubit 
but leave the other one alone. Then the projectors | +1 ) ( +1 1 (g) 1 and | — 1 ) ( — 1 1 (g) 1 
form the singleton events in the the classical probability model. But clearly any op- 
erator on the second qubit commutes with these projectors and so there is more 
in this system than is wholly representable in a classical probability system. But 
because the second system does commute with these projectors it is possible to form 
a quantum conditional expectation of the system upon the first. The commutant 
gives you the set of all possible operators that can be mapped onto a classical prob- 
ability space though a quantum conditional expectation. We will briefly discuss this 
mapping next. 

3.2.3 The quantum conditional expectation 

In a classical probability space, if we are given a random variable y, or more generally 
a set of random variables {ys : < s <t}, the distinct outcomes of those variables 
form a set of events. Prom these events we are able to take unions and complements to 
make a cr-algebra, y = a {yg : < s < t}, representing the questions we can answer 
about the model given the observation of the random variables {yg '■ < s < t}. 
Then for every event e y, (assuming P(Aj) ^ 0) we are able to compute the 
conditional expectation value of any random variable x, via Eq. (3.18) 



[A, B]y^O and still have [A, y^] = [B, Yi\ = 0. 



E(x|A) 




(3.41) 



E(XaJ 
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In mapping quantum mechanics on to classical probability theory everything still 
applies, as long as the operators {Yg : < s < t} all mutually commute. If we have 
a projection P;s^ generated from these operators, we can define a quantum conditional 
expectation value, 

This equation shows how crucial that X and commute in order for this equation 
to make sense as a classical analogy. Not only do we need [X, Px] = in order for X 
to be block diagonalized via the labels A, but also in order for this expression to be 
interpretable as a conditional expectation value, we needed to have {X Px) = (Pa X) 
for any A. Classically, we took a conditional expectation value to a conditional 
expectation by multiplying by the "projector" onto the event Ai, sec Eq. (3.19). The 
same is true in the quantum case, as can be seen in a finite dimensional example. 

From a set mutually commuting obscrvables {Yi, Y2, . . . we can form a com- 
mutative *-algebra '3^ that is spanned by the orthogonal projectors {Pa}- These 
projectors form a resolution of the identity so that Pa = 1. For any operator X 
in the commutant of y, the quantum conditional expectation is defined as [25] 

E{X\^)^J2^-^ Pa. (3.43) 

A 

The quantum conditional expectation has a number of properties in common with 
the classical conditional expectation. Specifically, 

E(E(X|^)) - E(X) (3.44) 
M{YiXY2\^) ^YiM(X\^)Y2 ^ Yi,Y2 E ^ (3.45) 
E(X|^) =E(X)1 V X independent of ^. (3.46) 

The quantum conditional expectation has some operator specific properties, 

E(l|?^) = l (3.47) 

E(X'^I^) =E(X|^)^ (3.48) 

E {X^X\^) > 0. (3.49) 
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Finally, an extremely important property that also carries over from the classical 
conditional expectation is that for all F e ^ and X e W 



This property is as important here as it is in the classical case, because it is often 
used to identify^ what the conditional expectation should be when it is intractable 
to find an explicit representation for {Pa}- This is particularly true in the infinite 
dimensional case and then Eq. (3.50) is taken as the defining characteristic for the 
conditional expectation. In other words, if you can find an operator X & '3^ that 
satisfies this equation, then you have the conditional expectation for X given ^ [25]. 

3.2.4 The conditional expectation and generalized measure- 



Before considering a specific example of a continuous-time quantum conditional ex- 
pectation, we briefly pause to discuss the connection between the quantum con- 
ditional expectation and generalized measurements, as traditionally formulated in 
quantum mechanics. We argue here that these two ideas are essentially equivalent 
and will specifically show that any measurement given in terms of a countable set 
of distinct Kraus operators {Mm} is equally well represented in terms of a quantum 
system mapped to a classical probability model. In particular, the posterior state 
p\jn is a Schrodinger picture version of a quantum conditional expectation value. 

A general quantum measurement on a Hilbert space "H is specified by a set of 
Kraus measurement operators {M„} where the indices m label the outcomes of the 
measurement [63] . The measurement operators are required to satisfy a completeness 
relation 



E(E(x|^)y) = E(xy). 



(3.50) 



ments 




(3.51) 



m 



i.e. guess and check. 
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The completeness relation means that {M^M^} is a valid POVM, and in particular 
that under the state p the expectation values, { Tr (pM^M^)} define a probability 
measure for a sample space fl = {1, . . . ,m, . . .}. Upon receiving the outcome m, a 
mixed state p updates to the posterior state p\m via the map 



Tt{pmImj' 



(3.52) 



Our claim is that there exists a Heisenberg picture formulation where the use of 
this posterior state is replaced by a conditional expectation. Proving this is not diffi- 
cult, by using the fact that any generalized measurement is equivalent to a projective 
measurement performed on an ancillary system after an entangling unitary opera- 
tion [63]. The equivalent Heisenberg/quantum probability picture is then to evolve 
all of the operators with the entangling unitary and then calculate a conditional 
expectation value for a post interaction projector. 

The specific relation is that every measurement outcome can be modeled as a 
state in an ancillary system with Hilbert space T-La where there are basis vectors 
\m) that correspond to the outcomes of the measurement. Clearly for this to make 
sense, the dimension of TIa must be as least as big as the number of measurement 
outcomes. The entangling unitary operator U then maps a fiducial pure state |0) to 
the basis vectors |m) and when doing so applies the operator Mm to system. Thus, 
there always exists a unitary U such that for every system state vector ip 

t^lV')|0) = ^M^,|V')|m'). (3.53) 

m' 

Operating the projector 1 | m ) ( m | on the post unitary state, results in 

(l®\m){m\)U\ijj)\0) ^J2^rn'\fp)\m) (m|m') = M^|V')|m). (3.54) 

m' 

In other words by applying the projector \m){m\ to the post interaction state, we 
have applied the measurement operator to the system and projected the ancilla 
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into the measurement eigenstate m. For a general system state p, the probabihty for 
obtaining the outcome m is then given by 



F{m) =Tr i^{U p(S)\0){0\U^) 1 (g) \m){m\j 

= Tr(pO|0)(0| {uH^\m){m\U)) . (3.55) 



Applying a unitary transformation to an operator does not change its spectrum and 
so a unitary evolved projector is still a projector and in this case, one that is no 
longer acting solely on the ancilla. 

The quantum probability description between the generalized measurement with 
operators {Mm} is to use a Heisenberg picture version of the "purification" of that 
measurement. Specifically the commuting set of operators that we are conditioning 
on is simply the unitarily evolved projectors {P^ = W 1 <S> \m){m\U^. 

It is not difficult to show that the partial trace of the posterior state p\m with the 
system operator X is equivalent to the conditional expectation value of the operator 
W{X (8) 1)U conditioned on the projector P^, under the joint state p | )( |. In 
other words we have the equality 



Quantum filtering theory has a particularly grandiose title but in actuality it is not 
much more than what we have already developed here. Boutcn ct al., wrote an award 
winning introduction to the problem quantum filtering and quantum stochastic cal- 
culus [25]. This section does little more than quote their final results. The quantum 
filter is in essence nothing more than the conditional expectation for a system ob- 
servable X, based upon a light observable, e.g. Q\, after both have interacted though 
a unitary Ut- The two light measurements that are typically considered are that of 



^"l^ sys{p\m.X) 



Tr(p® |0)(0| U^{X®1)U Vm) 
Tr(p® |0)(0| P^) 



¥.{U\X ®l)U\Fm) ■ (3.56) 



3.3 Quantum Filtering Theory 
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measuring an output quadrature, e.g. UjQlUt, or a direct photon number measure- 
ment, e.g. Al^Ut. Here we have focused on classical and quantum diffusion, and 
so we will assume that we are measuring the quadrature Ql- In addition, to simplify 
the notation, we assume that we are considering a single field mode and will drop 
the label i. More general expressions are not difficult to derive once the formalism 
is in place; for examples see [64]. 

The quantum filter for time independent system observable X is written as a 
time indexed map 7rt(X) and is the conditional expectation of the unitarily evolved 
operator U}XUt, conditioned on the (continuous) set of measurements of an output 
process {Ytj^^^. In the diffusive case 

Yt = ulQtUt = UliAt + A\)Ut. (3.57) 

When Ut is given by a general single mode, it is the solution to the QSDE given in 
Appendix C Eq. (C.27). In the ID case with no scattering interactions we are able 
to calculate that, 

Yt^Qt+ [ Ul{L + L^)Usds. (3.58) 
Jo 

The general expressions for the unitary evolution of any system operator X and the 
fundamental field processes A^, A]^ and A]^ are given in Appendix C, Sec. C.1.1. 

Sec. 2.5.2 showed that in vacuum expectation, Qt has statistics of a Wiener 
process. Because of this one may be tempted to interpret Eq. (3.58) as the time 
integral of a system operator plus quantum white noise. We urge the reader to 
avoid this temptation because, as Sec. C.1.1 shows, U}{L + L^)Ut is generally a very 
complicated expression involving integrals with respect to dAt, dAf, and dAj. is a 
fully coherent operator acting on the joint Hilbert space T-L (8) ^(f)[o,t]) and does not 
generally commute with Qt- 

It is, however, not difficult to show that Y^ commutes with itself at different 
times, i.e. [Y^, Yg] ~ for any times t and s. Therefore a continuous observation of Y 
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between the times < s < t makes a set of commuting observables {Yg : < s < t}. 
This set of observations can then be used to form a commutative von Neumann 
algebra The quantum filter Trt{X) is then given by the conditional expectation 

ntiX) = ^{UlXUt\^t). (3.59) 

Finding an expression for 'Kt{X) requires implementing the conditional expecta- 
tion in the form given in Eq. (3.50). Note that in general, the conditional expectation 
depends upon the properties of the joint system field state and so you will arrive at 
different filtering equations if the system is in vacuum, [25] , a coherent state [65] , or 
a state with nonclassical photon statistics [66]. The quadrature measurement of a 
single mode in vacuum expectation is arguably the simplest of all cases, and is what 
we will use exclusively here. The bottom line result is that the quantum filter for 
any system operator X is given by the recursive QSDE 

d'Kt{X)^T:t{Coo{X))dt 

(3.60) 

+ (7rt( LtX + XL ) - TTi (Lt + L) (X) ) {dYt - tt* (L + Lt) dt) . 

with the initial condition 7ro(X) = E(X). This is very analogous to the classical 
Kushner-Stratonovich equation of nonlinear filtering [25] . The operator map 

Cqq{X) = +i[H,X]+L^XL-lL''LX -IX L'fL (3.61) 

is the 00 Evens-Hudson map, (see Sec. C.1.1) and is essentially the Heisenberg picture 
version of the Lindblad master equation. A serious draw back to the quantum filter is 
that because it is recursive, it will very rarely close. In order to propagate Eq. (3.60) 
for the operator X, we need to also calculate in parallel the filter for the operators 
A = L'' + L, B = L'^X + XL, and C = £oo(-^)- It's also highly likely that the space 
of possible system operators is not generated by simply these four operators. By 
calculating 7rt(A), we will also need to know the filter for 7rt(£oo(^)), which itself will 
likely generate more complicated operators. Fortunately a saving grace is that we 
can invert this equation to find an effective "noisy" system operator pf. The equation 
of motion for pt is the conditional master equation which will discuss in Sec. 3.4. 
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Before doing so, we would like to highlight one important issue that makes a 
strong distinction between quantum and classical filtering. In the classical case the 
filter is one of a couple of operations that one is interested in computing condi- 
tioned on an observation process {yt}- Another process that one is interested in is a 
smoother, which is defined classically as 

7rs,t{x) = lS{xs\ {yt' ■0<t' <t}) for s<t. (3.62) 

Classically this is a perfectly well defined thing to do, as long measurable 
with respect to the cx-algebra defining the global probability space (O, J-", P). One 
would generally then be tempted to define a quantum mechanical smoother, 

7rs,t{X) = M{UlXUs\^t) iors<t. (3.63) 

Unfortunately this object is not well defined for any system operator, X, because 
U^XUs is not in the commutant of To see why, consider that Yt = Qt + Jq UJ.{L + 
L'^)Urdr, which certainly has support upon the system Hilbert space via the time 
integral of U}{L + L^)Ur. There is no guarentee that [UlXUs, Ul{L + L^)Ut] for any 
X and times t, s. The reason that UjXUt is in the commutant of ^ is because we 
can show that U^QsUg — U^QgUt, for s >t. This property then shows us that 

[U}XUt, n] = [U}XUt, UlQMt] = U}[X, Qspt = (3.64) 

for s < t. This means that the post interaction system operator at time t is able 
to be conditioned on past measurements. However the same "advancement" trick is 
not possible for the system observable, and therefore there is no guarantee that Eq. 
(3.63) is well defined. If you simply threw caution to the wind and went through 
a smoothing calculation, even though U^XUs is not in the commutant of then 
it quite possible that by conditioning you could take positive operators to negative 
ones or even Hermitian observables into non-Hermitian operators [25]. 

Tsang proposed a time-symmetric quantum smoother where one calculates a 
smoothing operation for a classical signal imprinted on a quantum system [67]. In 
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this case, the smoother is calculating a conditional estimate for the classical signal 
and therefore commutes with both the system and field operators. In Chap. 5 
we wish to form an estimate for the system state at the initial time t = given 
measurements up to time t. One might be tempted to try and formulate a quantum 
smoothing equation E(X|^), but as we just showed such an object is not in general 
well defined. Therefore we have to resort to different methods. 



3.4 The Conditional Master Equation 

Sec. 3.3 just showed how one could form a conditional estimate for system oberv- 
ables based upon a measurement of an output light quadrature via the Heisenberg 
picture formalism of quantum probability. A serious drawback is that the filtering 
equations are recursive and hardly ever close. The saving grace of this is to convert 
to a randomized Schrodinger picture and work with a Conditional Master Equation 
(CME). 

We know from Sec. 3.3, that every commutative space of operators is mappable to 
a classical probability space. We also know that from the definition of the conditional 
expectation, the filter fit^X) = Ei{U^XUt\'3^t) is an operator in And so if we 
generate a classical probability space (fi, J-", P) for then the filter vrf(X) should 
be representable in that space. Furthermore in a given experiment, the eigenvalues 
we receive from measuring Y form a realization of a classical stochastic process 
defined on that probability space. 

What this means in practice is that we will now focus our attention to solely 
system variables a treat the measurement record yt as a classical stochastic process. It 
is in this sense that we call the conditional master equation a semiclassical equation. 
Specifically, it treats the output measurements {5^}(>o as a classical random variable 
while the system undergoes a noisy quantum evolution. In our opinion, it cannot 



Chapter 3. Classical and Quantum Probability Theory- 



US 



be over emphasized that this process has its origin as a quantum object and so not 
every operator will commute with Yt - particularly past system observables. 

With that warning to tread lightly, finding a semiclassical equation for a noisy 
system state pt is remarkably easy. Such an equation begins by enforcing that for 

every system operator X, we must have^ 

Tr{ptX)=nt{X). (3.65) 

To find an SDE for pt, we simply notice two things. In every term of Eq. (3.60), 
there is a coefficient TitO^) of some operator Y which is in turn relatable to Tr{ptY). 
The second is that the only quantum stochastic differential in Eq. (3.60) is dYt, 
which from Eq. (3.58), satisfies the quantum Ito rule, 

dYt dYt - dt. (3.66) 
Therefore in the semiclassical mapping dyt also has the Ito rule 

dytdyt = dt. (3.67) 
With these two observations we have, 

TT{dptX)=TT{ptCoo{X))dt 

+ {Tr {pt {L^X + XL)) -Tr {pt {L^ + L)) Tr {ptX) ) {dyt-Tt{pt{L^ +L)) dt). 

(3.68) 

We can use the cyclic property of the trace to decompose £oo(^) into an adjoint 
map acting on pt, 

TT{pt£oo{X)) = Tr {{-t[Ht,pt] + LptL^ - '^L^Lpt - \ptL^L) X) . (3.69) 



''Mathematically, this equivalence may seem strange as the left hand side is a scalar 
valued random variable while the right hand side is an operator in The equivalence 
is made though the classical outcome w, that labels the set of eigenvalues we receive from 
the measurement. 
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By making the same kind of transformation of the remaining terms and noting that 
it is true for any system operator X, we arrive at the conditional master equation 
(CME) 

dpt = -i[Ht, pt] dt + V[L]{pt) dt + H[L]{pt) dvt, (3.70) 

with the initial condition is po = p(0) and we made the following definitions. V[L]{pt) 
is the Lindblad operator map commonly found in open quantum systems and is 
defined as 

V[L]{pt) = Lp,L^ - iL^Lpt - IptL^L. (3.71) 

l-L[L](pt) is the state update map defined as 

n[L]{pt) = Lpt + ptL''- Tr{{L + Lt) p,) p^. (3.72) 

This map shows how the state updates, weighted by the strength of the stochastic 
process, 

dvt = dyt - Tr{{L + L^) pt) dt. (3.73) 

The random process Vj, called the innovation process, plays an important role as 
it is the only random contribution to the CME. In the next section we will review 
the proof that when everything about the measurement yt is properly specified, then 
dvt is a realization of a Wiener process. 



3.4.1 The innovation process 

Here we will show that in the innovation process Vt transforms yt into a Wiener pro- 
cess by subtracting off the conditional expected mean. In classical probability. Levy's 
theorem is an important result because it gives necessary and sufficient conditions 
for showing that a given process is in fact a Wiener process. Roughly stated, if a 
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stochastic process rrit is a "local martingale" and obeys the Ito rule that {drritY = dt 
then it must be a Wiener process [25]. Martingales arc an important kind of stochas- 
tic process that play a crucial role in classical probability theory (see Sec. 3.1.1). In 
essence it is a random process where the conditional mean of any future increment 
is zero [22, 53]. 

The proof that Vt is a Wiener process is given in theorem 7.1 of reference [25] 
and relies on some fundamental properties of the conditional expectation. We quote 
this result in Lemma 3.1, for two reasons. The first is simply because it is easily 
shown and is a rather elegant result. The second is that Chap. 5 uses the fact 
that Vt is Wiener process only when pt is "consistent" with the actual statistics of 
{yt}t>o- Here consistency means that the correspondence Tr(piX) = TTt{X) holds in 
the sense that 7rt(X) is a conditional expectation of X with respect to under the 
true quantum state. If pt does not exactly match 7rt(-) because its initial condition 
is wrong or any number of other approximations, then vt will not generally have the 
statistics of a Wiener process. See Sees. 5.4 and 5.5 for further discussion. 

Lemma 3.1. In vacuum expectation, the quantum stochastic process Mt = Yt — 
Jq Tis{L + L"^) ds is an instance of a quantum Wiener process in that its finite dimen- 
sional statistics are independent mean zero Gaussian random variables with variances 
equal to the time differences. 

Proof. In Sec. 3.1.1, we review that the classical definition of a martingale is that 
it satisfies the property Ei{mt\^s) — for s < t. In the quantum case this is 
equivalent to showing that E( (Mt — Ms) |^^) = 0. The reason for this is because the 
conditional expectation obeys the property that for every K G IE( K |^) = K. 

By the definition of the conditional expectation, we have that for every X e ^ 



E (E(Mi -MsmK ) = E ((M, - M,) K ) 



(3.74) 
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Substituting the definition of Mt, 

E {{Mt -M,)K) = E {{Yt -Ys)K)- E(^ ds' n^L + L^)K). (3.75) 

Notice, however, that 7rs'{X) = E(C/],XC/s/|^/) we can again use the definition of 
the conditional expectation to convert the second term into an expectation of an 
integral of Ul,{L + L^)Us'- In Eq. (3.58) we solved for Yt, and found that Yt = 
Qt + Jq ds' Ul,{L + L^)Us'- After substituting that solution, Eq. (3.75) simplifies to 



M{{Mt-M,)K) = M{{Yt-Y,)K)-MQ^ ds' U], {L + Lt)[/,, K 
^¥.{{Qt-Qs)K). 



(3.76) 



Any operator X e ^ is an operator which acts on the system Hilbert space and the 
Fock space associated with hght operators defined for times s' e [0, s). The operator 
Qt — Qs acts on light field states defined on the time interval [s,t\. This means that 
this final expectation value factorizes to show, 

E {{Mt -M,)K) = E(gt - Q,) E(K) = 0. (3.77) 

This is zero because the quadrature operator Qt is mean zero in vacuum, and so Mt 
is indeed a martingale, when we condition on The proof is finished by simply 
observing that dMtdMt = dt and so Mt is a quantum Wiener process by Levy's 
thoerem. □ 



3.4.2 The Ito correction in the conditional master equation 

The quantum filter 7rt(-) is given by an Itb form quantum stochastic differential 
equation and therefore the conditional master equation is a semiclassical Ito equation. 
In addition to an Ito integral, there is also a Stratonovich integral where the rules 
of standard calculus still apply, but the statistical properties are more subtle (see 
Appendix B for their respective definitions) . While the two forms of integration are 
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distinct, they are related by a conversion formula, resulting in the "Ito correction", 
derived in Appendix B.1.1. In Chap. 4, we are required to work with a conditional 
master equation written as a Stratonovich integral and so we derive this conversion 
here. 

For a general measurement operator L and Hamiltonian H, the conditional master 
equation is 

dpt = -i[H, pt] dt + V[L]{pt) dt + H[L]{pt) dvf (3.78) 

The first two terms are simple deterministic integrals and are unaffected by the 
choice of stochastic integral and can be ignored. The integrand in Ito integral is the 
conditioning map and for reference is, 

UMpt) = Lpt + ptL^- Ti{Lpi + piLt) p,. (3.79) 

For the remainder of this section we will suppress the parameterizing argument and 
simply write T-L{pt)- 

A one-dimensional Ito integral that is typically considered in an Ito- Stratonovich 
conversion has a differential 

dxt — b{xt)dwt, (3.80) 

for a smooth integrand b{x). When written as a Stratonovich equation, this differ- 
ential is notated as 

dxt — b{xt) o dwf. (3.81) 

The Ito correction is what results when you enforce that both integrals must give 
the same process Xt, and the final result is that 



b{xt) o dwt = b{xt) dwt + | ^(^«) K^t) dt. (3.82) 



The additional term is known as the Ito correction. 
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To immediately apply this result to the conditional master equation would involve 
defining what it means to take the derivative the 'H{p) operator with respect to p. 
Rather than defining a calculus of super-operators, we will return to the roots of the 
relation (see Appendix B.1.1) and write the correction as 

di = n{pt) o dvt - n{pt)dvt = {n{pt + 1 dpt) - n{pt)) dvt. (3.83) 

The map "H is unfortunately not a linear operator in pt and so the integrand on right 
hand side is not simply H^^dpt). After a little algebra we find that 

dI^l(^L dpt + dpt - Tr{Lpt + ptL^)dpt 

- Tr(L dpt + dpt L^) (pt + \ dpt) ) dvt. (3.84) 

To simplify this expression into a final form, we will substitute the Ito equation 
expression for dpt and apply the Ito rule that dvtdvt = dt with all other differential 
products being zero. This means that when substituting dpt we need to only use 
the stochastic term as any deterministic term will result in a product dtdvt = 0. 
Furthermore any term with two powers of dpt will also be zero as that will result in 
three powers of dvt- The simplified expression is then, 

dl = \ {LH{pt) + n{pt) Lt - Tr ((L + Lt) n{pt)) pt - Tr((L + Lt)p,) H{pt) ) dt 
= Ic[L](pt) dt. 

(3.85) 

Substituting the definition of 'H[L](pt), the ltd correction map, Ic[L]{pt), simpli- 
fies to 

Ic[L]{pt) = {LptL^ + iL'pt + \ptL'^'') 

- ((LtL) + i(L^) + i(Lt^))p, (3.86) 

- (L + Lt) {Lpt + ptL^ -{L + L^)pt) 

where {X) = Tr(Xpt). 
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Ultimately the Stratonovich form of the conditional master equation is then given 

by 

dpt = -i[Ht, pt] dt + V[L]{pt) dt - Ic[L]{pt) dt + n[L]{pt) o dvf (3.87) 
3.4.3 The conditional Schrodinger equation 

In this chapter we have focused solely on the interpretation of quantum mechanics 
in terms of probability spaces. That description lead to a quantum conditional ex- 
pectation and a quantum filter, which is described in a Heisenberg picture. Prom 
that Heisenberg picture description we found a conditional master equation (CME). 
When the state of the system is pure and the dynamics are such that it will re- 
main pure, then propagating a full density matrix is unnecessary and a conditional 
Schrodinger equation (CSE) is sufficient. Chap. 5 uses this fact for computational 
efficiency and therefore we include the general expression for a CSE based upon the 
CME in Eq. (3.70). The details of the conversion can be found in [37]. 

The CME gives the evolution for pt in terms of an Ito differential dpt, Eq. (3.70). 
Any density matrix whose purity is 1 can be represented by an outer product of a 
normalized state vector in Hilbert space [63] , 

pt = \ijt){^t\ if and only if Tr{p^) = 1. (3.88) 

Furthermore \ipt) is unique up to an arbitrary constant phase. While we have worked 
quite hard to derive the CME and give it physical meaning, practically speaking it 
is "nothing" more than a matrix valued stochastic differential equation defined on 
a classical probability space. Therefore a method for moving from a CME to a 
CSE is to hypothesize the existence of a random state vector j'^t) satisfying some 
vector valued SDE djV't) and then solve for the differential that give the differential 
d{\iJt){A\) equal to the CME. 



Chapter 3. Classical and Quantum Probability Theory- 



ISO 



We note that this is not a standard derivation. Typically in quantum optics, one 
first derives a stochastic Schrodinger equation via an unraveling of a master equation 
that considers photon counting and then takes a diffusive limit [51]. Having already 
developed the CME from the quantum filter, it is much simpler to perform the 
above calculation, rather than including an independent derivation. The resulting 
equations are identical. 

The derivation of d\il^t) is not difficult as we can see that the only random process 
that enters the Eq. (3.70) is through the innovations Vt and it does so linearly. We 
also know that Vt satisfies the Ito rule, dvtdvt — dt. Therefore a reasonable form for 
d\ilJt) is 

= At\A) dt + Bt iV't) dvt. (3.89) 

for some time-adapted but possibly state dependent operators At and Bf. The adjoint 
of this equation is then 

d{iPt\ = {MA dt + B\ dvf (3.90) 
And so we need to solve for At and Bt subject to the constraint, 

dpt = d\iJt) {^t\ + \^t) d{i;t\ + d\'il^t) d{A\- (3.91) 
Doing so is not too difficult and the operators turn out to be 

At = -iHt - ^ (LtL - 2 (Lt> L + (Lt> (L)) (3.92) 

and 

Bt^L-{L). (3.93) 

Traditionally the operator L is Hermitian, and so if we choose our favorite example 
of L = \/k X then 



d\xlJt) ^(^-iHt-\K (J, - {J,)f ) \i^t) dt + V^ (J, - (J,) ) iV't) dvt (3.94) 
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Chapter 4 



Projection Filtering for Qubit 
Ensembles 



This chapter derives an approximate form for the conditional dynamics of an ensem- 
ble of n qubits under the assumption that the state will remain nearly an identical 
separable state. We assume that the system is undergoing a diffusive measurement 
of the collective angular momentum operator while simultaneously experiencing 
strong global rotations. The approximation is made by formulating a projection fil- 
ter from the exact conditional master equation. The projection is made through the 
technique of orthogonal projections in differential geometry. Here we identify the 
space of identical separable states as a Riemannian manifold and then project the 
conditional master equation into its tangent space. We also review the elements of 
differential geometry that make such a mapping possible. Finally we test the accu- 
racy of the projection filter numerical by comparing it to simulations of a stochastic 
schrodinger equation. We find that it matches the conditional mean spin projections 
to within a 5% RMS error. 
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4.1 Introduction 

Numerical integration of a conditional master equation is generally a resource inten- 
sive exercise. Specifying a general mixed state for a d-dimensional quantum system 
requires (P — 1 real parameters. Furthermore the total dimension of a many body 
system grows exponentially. A system of n qubits generates a 2"-dimensional Hilbert 
space, requiring 2^" — 1 parameters. This "curse of dimensionality" is true even in 
a unconditioned system and so physicists often search for symmetries that allow for 
a more efficient description. The nonlinearity in the conditional master equation 
means that a number of symmetries that are often preserved in an uncondition map 
are no longer exploitable. 

A projection filter is a tool that was developed in the context of classical filtering 
theory and provides a general method for constraining nonlinear estimators to remain 
in a lower dimensional space [31, 32]. Within the past decade these tools have 
also been applied to quantum systems, specifically for cavity QED systems [33- 
36], collective spin systems [37], and low rank approximations for general master 
equations [38]. The fiexibility of the projection method is provided by its formulation 
in the language of differential geometry. In the quantum framework we have a high, 
possibly infinite, dimensional manifold representing the space of possible states. It 
is often the case that the system is initialized in a state with a large amount of 
symmetry thereby initially allowing an efficient, lower dimensional representation. 
The project filter modifies the exact evolution in such a way as to constrain the 
system to remain in the lower dimensional submanifold. It does so by projecting the 
differential into the lower dimensional tangent space. 

Here we focus on an ensemble of n qubits initially prepared in an identical tensor 

product state. In other words, the total state of the system ptot is initialized in as a 
n-fold tensor product of a single qubit state p, 

Ptot = P^" (4.1) 
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Clearly this is a highly symmetric and easily represented state, as a single qubit 
state requires only 3 parameters to be specified uniquely. If the master equation 
acts on each qubit individually then the total system will remain in an identical 
separable state for all future times. However for a joint qubit system undergoing 
a weak, diffusive measurement of the collective angular momentum variable J^, the 
conditional master equation is generally entangling. In the long time limit, this kind 
of measurement most often results in the system projecting into a nonseparable Dicke 
state. 

In this chapter we demonstrate, through numerical simulation, that if the system 
also undergoes strong, randomized rotations in addition to the collective measure- 
ment then the system will remain nearly sparable. Under this assumption that this 
is the case, we apply the technique of projection filtering to the conditional master 
equation so that it maps identical separable states to identical separable states. 



4.1.1 An introduction to differential projections 

The general technique of differential projections can be understood though the fol- 
lowing example. Consider an ordinary scalar function defined on three dimensions, 
f{x, y, z). The chain rule shows that the differential for / is 

df = ^dx + ^dy + ^dz. (4.2) 
ox ay oz 

Suppose that we have a particle with position vector x(t), and at each time t we 
evaluate f{x{t),y{t), z{t)). In order to have a complete description for / we clearly 
need to keep track of all three components because a change in x, y or z induces a 
change in /. Now suppose that keeping track of z is too much of a hassle and we 
are only interested in tracking x{t) and y{t). The question posed by the projection 
filter is, "how should we modify / so that we only need to track x and y?" The 
answer comes from the fact that ii ^ — everywhere then / doesn't change with z 
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and ultimately z can be ignored. Therefore the modification we should make it set 
the gradient of / to point only in the xy-plane, i.e. set ^ = 0. This modification is 
the differential projection of /. Therefore we have a modified function f\{^^y}, whose 
differential is simply, 

df df 

The difficulty in forming a projection filter is that / is not usually written in terms 
of, z}, but instead some other set of parameters, {x'^y'^z'}, or even just t. 

Furthermore the desired subspace might be some complicated 2D surface with pa- 
rameters V and w. It is very likely that v and w may not even be orthogonal, at least 
not in the same sense x, y and z are orthogonal. The first challenge in developing a 
projection filter is to give the desired objective a geometric interpretation. 



4.1.2 The conditional master equation 

Before embarking on a description of the geometry of quantum states, we will first 
collect all of the necessary equations from previous chapters here for a single point 
of reference. Sec. 3.4 found that the state of an atomic system conditioned on 

a continuous diffusive measurement is easily represented by the conditional master 
equation (CME) given by the ltd differential, 

dpt = -i[H, pt]dt + V[L\{pt)dt + H[L\{pt)dwt. (4.4) 

(See Appendix B for a review of classical stochastic differential equations.) The 
dissipation and conditioning maps, X'[L](-) and 'H[L](-), are parameterized by the 
measurement operator L and are defined as 

V[L\{p) = LpL^ - iL^Lp - IpL^L (4.5) 

and 

UMp,) = Lpt + ptL^ - Tr((L + Lt)p,) p,. (4.6) 
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Often we will omit the parameterizing argument and simply write 'D{pt) and 7i{pt)- 
Also note that h has been set equal to one, so that the Hamiltonian operator H has 
units of frequency and the measurement operator L has units of root frequency. 

Note that Sec. 3.4 used a slightly different notation, referring to the innovation as 
dvt rather than dwt. Sec. 3.4.1 showed that innovation computed from the measure- 
ment record yt has the statistics of a Wiener process, when the initial condition po 
coincides with the "true" initial state. In Chap. 5 this will not be the case, however 
here we are assuming that the initial condition is known, and in particular, that it 
can be written as p®". Therefore, throughout this chapter we will consider the in- 
novation to be Wiener process and write it as dwt. Sec. 3.1.4 reviews the statistical 
and defining properties of the Wiener process. 

The physical system that we have in mind is the idealized linear Faraday inter- 
action in Sec. 2.7, meaning the measurement operator is 

L = v^J, (4.7) 

where k is a constant rate. In addition to this measurement, we consider applying a 
uniform but time varying magnetic field, leading to the Hamiltonian 

H = nt)j, + p{t)jy + nt)j,. (4.8) 

The control fields p{t) are assumed to be real valued, deterministic functions of 
time^. 

For reasons made apparent in Sec. 4.2.4, we will also need to work with the 
Stratonovich form of the CME, 

dpt = -i[H, pt]dt + V[L]{pt)dt - Ic[L]{pt)dt + n[L] o dwf (4.9) 

The conversion from the Ito form generated the Ito correction map, derived in Sec. 

^In Chap. 5 the control fields are written as b{t), however in this chapter the coordinates 
6* indicate the projected coefficients for the stochastic terms, so here we use /*(i) instead. 
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3.4.2, which is 

Ic[L]{pt) ^LptL'^ + |LVt + \ptL'^'' 

- {{L^L) + \{L^) + \{L^-))p, (4.10) 

- <L + Lt> (Lpi + ptLt-(L + Lt> p,). 

Here the expectation value of the operator X has been written as {X) = Tr {pt X). 



4.2 Differential Manifolds 

A manifold Ai is most generally a continuous set of point that can be locally mapped 
to a d-dimensional Euclidean space. In a neighborhood of any point in A4 we can 
define a smooth mapping points in that neighborhood to a fiat space of dimension 
d. How smooth this mapping needs to be, often depends upon the author and 
the context, generally it must be smooth enough so that the tools of differential 
calculus can be applied. The concept of smooth is quite at odds with the random 
nature of Brownian motion, as the Wiener process is provably nondifferentiable with 
probability one. Here we will be ultimately considering random trajectories on a 
differential manifold. The resolution between these two conflicting notions is that 
while a diffusive trajectory is nondifferentiable, it is a trajectory in a smooth space. 
i.e. a two-dimensional Brownian motion is not a differentiable curve, but it is defined 
on a 2-D plane which is smooth. 

The specific manifold we need is the space of all vahd density operators for n 
qubits. For a single qubit, the Bloch vector defines a perfectly respectable one-to- 
one mapping between a quantum state and the 3-dimensional Euclidian ball. The 
conditional master equation then has a representation as a diffusive trajectory within 
the Bloch ball. Defining an equivalent representation for a d— dimensional quantum 
state is nontrivial and is still the subject of current research. While there does exist 
an equivalent mapping to a ball living in a 2*^— 1-dimensional space, the boundary and 
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smoothness of this mapping is quite complex and not well understood [68]. Here we 
will only be interested in a geometric representation of states that can be written as 
n copies of a single qubit state. Ultimately a Bloch vector representation is sufficient 
for our purposes. 

4.2.1 Tangent spaces 

The differential projection we ultimately want to preform requires a deeper under- 
standing of how to define a gradient in a more abstract setting. A key conceptual 
point is that we make an association between basis vectors in ad dimension space and 
the partial derivatives we can take of a function defined on the manifold. Specifically, 
a point p in the manifold A4 is representable by the coordinates {x^, x'^, . . . , x*^}. Any 
smooth function / on the manifold, evaluated at this point p can therefore also be 
represented as a function of these coordinates, f{x^,x^, . . .x'^). At the point p the 
partial derivative of the function / with respect to the coordinate defines the rate 
of change of / as is varied, i.e. it defines a line tangent to / pointing in the 
direction of x*. 

The relation between partial derivatives and vectors can be formed by associating 
the basis element with the partial derivative operator Differential geometry 
is concerned with defining structures that are independent of any given coordinate 
system. Calculating a partial derivative with respect to a different coordinate system, 
{y*}, is easily accomplished by applying the usual chain rule. 



The coordinate independent quantity here is the space of all possible partial deriva- 
tives we could take at this point. At first glance this may seem like a rather large 
object, however the chain rule just showed that a partial derivative in one basis is 
simply a linear combination of partial derivatives in another basis. Therefore the 



d dx^ d 
dy^ ~ dy^ dx^ ' 



(4.11) 
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space of all possible partial derivatives is simply the linear span of partials taken 
with respect to some basis. This space is called the tangent space of A4 at point p, 
denoted by, 



Note that the tangent space is a rf- dimensional vector space, as we are taking lin- 
ear combinations of d basis vectors. Often we will discuss a directional derivative, 
meaning that we will be taking a derivative in the direction of another point in the 
manifold. But as this could have any relation to a given coordinate system, the di- 
rection derivative defines a vector in the tangent space. Another useful bit of jargon 
is that if you have a tangent vector defined for ever point in the manifold then this 
defines a vector field. 

4.2.2 Riemannian Metrics and orthogonal projections 

The tangent space TpA4 defines the set of all possible partial derivatives one could 
make at the point p. However it does not describe how those derivatives are related. 
While in a Cartesian basis we have a sense that e^: is orthogonal to By, in general its 
hard to tell how the arbitrary vector e„ is related to e^. The missing element is a 

metric, (■, describing a positive definite inner product between any two tangent 
vector fields. At each point p we can take the dot product of e„ and e^, to see how 
they are related. If the space is Euclidean, then the metric well report the fact that 
is orthogonal to e^^, which is not true in general. 

A Riemannian manifold is a manifold Ai that is equipped with a metric that 
varies continuously between different points. While in a Euclidean space the inner 
product between two vectors doesn't change between different points, this is not 
true in a general space leading to much richer geometries. For a basis of vectors {ej} 
spanning the tangent space TpA4 the metric at that point can be written as a d x d 




(4.12) 
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matrix with components, 

gij{p) = {ei,ej)p. (4.13) 
In addition to being positive definite, a metric is also symmetric in that (Cj, 6^)^ = 

A metric gives a notion of two vectors being orthogonal and from that we are 
able to make an orthogonal projection. This is crucially important as we wish to 
project the conditional master equation into the tangent space of states that are n 
copies of a single qubit state. For a Euclidean space, the orthogonal projection of 
the vector v — v^ei + v^e2 + v^e^ onto the XY plane is trivial to compute, as it 
simply discards the 63 component. Given a metric and a general manifold we can 
make a similar formulation. 

Suppose for a Riemannian manifold A4 we have a submanifold Af C. A4 oi dimen- 
sion n < d. Without explicitly constructing an orthogonal basis for every tangent 
space TpAi, we would like to find a map that discards the vector components orthog- 
onal to TpAf. This can easily be done, given a basis of vectors {vi : i = 1, . . . ,n} 
that span TpAf. The metric (•, ■)p taken from Ai, can equally well be applied to 
J\f as their tangent spaces overlap. Applying this metric to {vi} we have the n x n 
matrix with elements 

gi,{p) = {vi,v,)p. (4.14) 

As the metric is positive definite, this matrix is invertible whose entries are often 
written as, g'^^{p) = {g{p))-j^- We can now show that that the map Uj^ : TpM. — >■ 

nM-9'Kp){vj,-),Vi (4.15) 

is equivalent to discarding the component of w orthogonal to TpN . 

The projection map should operate as the identity for any vector u e TpM. 
To check that this is true, TpM = spanl^j : i — 1 . . .n}, and so u be written as 
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u — u'' Vk ior some coefficients m^. Tlien we liave 

(4.16) 

^u" g'^{p)gjk{p) Vi 
= u^5i Vi = u. 

sliould also return zero for every vector ortliogonal to TpM. Tliis is also easy to 
check, as for every v-^ in the orthogonal complement of TpM, we have {y, v-^)^ — 0, 
if V e TpAf. Therefore, 

^m{v^) = g'^ip) {vj, v^)^ Vi = 0. (4.17) 

But as TpM = TpM U {TpMY, Uj^ is the correct mapping. 

Note that Eq. (4.15) required only specifying the metric gij{p) on the submanifold 
M and does not require an explicit representation for tangent vectors outside of this 
subspace. This is the reason why it is not necessary to find an explicit mapping 
between the space of n-qubit density matrices to a 2*^ — 1-dimensional Euclidean 
space in order to use the projection filtering methods. All we need is the valid metric 
for density matrices and a spanning set of tangent vectors in the submanifold we 
wish to project onto. 



4.2.3 Differentials on abstract manifolds 

As the conditional master equation is written in terms of stochastic differentials, 
we must see how a differential operates in a geometric context. In multivariable 
calculus the fundamental object is the differential of the coordinates, e.g. dx, dy, 
etc.. In a more abstract space, its difficult to intuit what the differential means. 
For instance how would one define a differential of a matrix, say the Pauli matrix 
CTj;. Would it be the differential of its entries, the differential of its eigenvalues or 
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maybe even a differential of both the eigenvalues and eigenvectors? The solution 
to this problem is to consider the differential not the individual points themselves, 
but the differential after the application of a smooth map to the Euclidean space. 
The differential in the abstract space is then inferred from the Euchdean differential. 
This process of inference is called the pullback, in that you are pulling back from the 
original mapping. Our ultimate goal is to interpret the conditional master equation 
dpt in the language of differential geometry and so we need to understand how it 
relates to a Euclidean mapping. 

Basic multivariable calculus shows that the total differential of the scalar function 
/ is given by, 

df = ^dx\ (4.18) 

A differential can also be view as a linear map acting on tangent vectors. The action 
of dj on the tangent vector ^ is defined to be 

^{^-%- ("^) 

While this may seem a bit obtuse at first, it is actually a very useful concept. To 
see why, consider the most basis function we can consider, namely the coordinate 
function x^. The differential dx^ has an action on the basis vector 

This shows that the coordinate differential is biorthogonal to and therefore 
can be thought of as a dual basis vector. When defining the tangent space in Sec. 
4.2.1 we found that the partial derivatives spanned that space and the coordinate 
transformation coefficients were simply linear expansion coefficients. The same is 
true for the differential dj in that ^ are the expansion coefficients in a dual space 
spanned by the basis vectors dx^ . The dual space is often called the cotangent space. 

A differential of a function between two spaces can also be defined. While we just 
considered the differential of a scalar valued function, /, we can also consider the 
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differential of a vector, matrix, or operator valued function. When df acted on a basis 
vector 1^ it returned a scalar value but with a more general mapping function 
the returned value should be something other than a scalar. It turns out that when 
you have a function (/? : — )■ A4, the differential of this is a function dcp : TxR^ — >■ 
^¥'(x)A4- The point being that when a function maps one space into another, the 
differential maps tangent vectors to tangent vectors. This is best illustrated though 
a concrete example, which we will give in Sec. 4.3.1, after formulating the Bloch 
vector representation as a Riemannian manifold. 



4.2.4 Stochastic calculus on differential manifolds 

There seems to be a fundamental inconsistency between a smooth, infinitely differ- 
entiable manifold and the nowhere differentiablc path of a Wiener process. From 
the Wong-Zakai theorem, (see Appendix D) we know that if there exists a smooth, 
ordinary differential equation that limits to a stochastic differential equation, then 
the limit should be interpreted as a Stratonovich SDE. When trying to incorporate 
stochastic differential equations into the language of differential forms, one approach 
would be to enforce a smooth approximation, apply the differential technique and 
then take a stochastic limit at the end. However a skeptical mathematician might 
wonder if such a result could be believed as the end result might depend heavily on 
how the smooth approximation was made. 

At a practical level, the second order nature of the Ito rule is difficult to reconcile 
with the notion of constrained motion on submanifold. A simple example of this is 
made in [32], which we will reproduce here. Consider the ordinary differentials, 

dx — dt 

(4.21) 

dy = 2t dt. 

We can easily see that this describes the parabola y — x^, which can also be con- 
sidered an immersion of a one-dimensional manifold into R^. (The parameterizing 
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function is ip{t) = Furthermore we can see that the coefficients of these 

equations, describe a vector, (1,2a;), tangent to the parabola. Were these equations 
used to describe the evolution of a system whose initial condition is on the parabola, 
we expect the system to remain on this submanifold. 

In contrast, consider an equivalent system of ltd stochastic differential equations 



dxt — dwt 

(4.22) 

dyt = 2xtdwt. 

The coefficients still describe a vector, (1,2^^), tangent to the parabola (p{t) — 
{xt, xf). However, a simple application of the Ito rule shows that these SDEs have 
the solution, 

= Xo + Wt 

(4.23) 

Vt^ yo + w1 - 1. 

So even if (a:o,a:o) = (0,0), these equations clearly docs not remain on the parabola, 
even though they are described by a vector field in its tangent space. Conversely, 
the Stratonovich SDEs 

dxt = dwt 

(4.24) 

dyt = 2xt o dwt 
have the solution 

Xt=Xo + Wt 

(4.25) 

yt^ vo + wt, 

which properly describes diffusion on the parabolic manifold. 

Our ultimate goal is to take a system of stochastic differential equations and 
modify their coefficients so that they remain constrained to a particular submanifold. 
This example demonstrates that in order for the tangent space projection to be 
effective, we must first express the Ito equation in a Stratonovich form. 
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4.3 The Bloch Sphere as a Riemannian Manifold 



In order to describe the space of density matrices in geometric terms, we need to 
choose a metric. There are an infinite number we can choose from and it is hkely 
that any results we arrive at will depend upon this choice. In the classical projection 
filtering problem, the metric Brigo et al. choose the the Fisher information, as it 
endows information theory with a nontrivial geometry [31]. van Handel and Mabuchi 
follow this example and use a quantum version of the Fisher information [33] . Later 
authors choose a different metric, namely the trace inner product [34-37] . While the 
trace inner product does not have an immediate connection to quantum information 
theory, it is significantly simpler to work with and, as we will shortly show, under 
this metric the Bloch sphere for a single qubit is Euclidean. In showing this, we will 
also formally construct the state space for a single qubit as a Riemannian manifold. 

For a Hilbert space of dimension d, we will follow [68] and refer to the set of all 
valid density operators as S{d). In the case of a qubit with d — 2, we already know 
that the Bloch sphere is an incredibly useful parametrization of this set. Formally, 
we define this as the map p : i3 C M"^ ^ (S(2) so that 



As every valid quantum state is required to be trace 1 and positive semi-definite, we 

have the constraint that |x| < 1, implying that B is the unit ball. 

Through the Bloch sphere mapping, we can construct a tangents space for S{2). 
This is first done by defining a directional derivative for S{2). Consider the Bloch 
vectors x and y {\x\ , \y\ G (0, 1) ). The derivative of p(x) in the direction of y is 
defined to be 



Then assuming the standard Cartesian coordinate system {x^,x'^,x^}, we have the 



p(x) = i(l+x-o-). 



(4.26) 




p(x + Xy) - p(x) ^ y ■ a 
A 2 



(4.27) 



Chapter 4. Projection Filtering for Qubit Ensembles 



146 



basis of tangent vectors 



_ 1 



(4.28) 



The tangent space at the point p(x) e «S(2) is then 



rp(x)5(2) =span{A : i = l,2,3}. 



(4.29) 



Armed with these tangent vectors we will choose, with some foresight, the trace 
inner product as a metric. For two tangent vectors Di and Dj we have the metric 



While this could result in a complex metric, we can see that for the qubit the basis 
vectors are Hermitian and therefor the metric is real. Also note that due to the cyclic 
property of the trace, it is also symmetric. Then for the qubit, simply calculating 
shows 



Up to a factor of a half, the Bloch sphere is Euclidian under this metric. 

4.3.1 Projecting the unconditional master equation 

In this section we work though an example of explicitly expressing an unconditional 
master equation for a single qubit in terms of a differential form dp. We will also 
do so generally, without assuming a Euchdean metric. Most generally p{t) is a 
map p : — >■ S{2). For any time, t, p{t) returns a vahd density matrix. Then 
as a differential, the master equation is the map dp : TtR^ — > Tp(t)(S(2), which is 
specifically 



(4.30) 



(4.31) 



dp = -i[H, p]dt + V[L]{p)dt, 



(4.32) 
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for a general Hamiltonian H and jump operator L. Instead of the direct mapping 
between time and density matrices, we would like to consider this in terms of the 
Bloch sphere mapping of Eq. (4.26). This can be done if we consider the time 
component as a kind of functional composition, so that p{t) — p{x{t)) for a map 
x{t) between time and Bloch vectors. Prom Eq. (4.26), the general expression for 
dp : T^m^ ^ Tp(,)5(2) is 

dp^ la'{x)aidx\ (4.33) 

To see that this is indeed a map the two tangent spaces we can simply calculate its 
action on the basis vector 




(4.34) 



where there is no sum in the final expression. This is clearly in the tangent space 
Tp(x)iS(2) as it is proportional to Dj = ^aj. Our ultimate goal is then to solve for 
the coefficients a*(x). 

Any traceless matrix 2x2 matrix can be written as a linear combination of Pauli 
matrices. As both the commutator [H, p] and the map V[L]{p) are traceless, both 
of these operations have some expansion coefficient in terms of the Pauli matrices. 
Sec. 4.3 found that the tangent space TpS{2) is also spanned by the Pauh matrices, 
meaning that —i[H, p\ and V[L\{p) are vectors in this space. Thus finding the 
coefficients a*(x) simply comes to projecting these maps onto the basis vectors 0^. 

Sec. 4.2.2 estabhshed that the general projection map Ilj\f can be written as Eq. 
(4.15), in terms of the metric and its inverse g'^^{x). We are able to write dp{t) as 

dp{^{t)) = -tg^^i^) (A, [H,p])^Djdt + g'^{^) (A, V[L]{p))^Djdt. (4.35) 

But as this is a differential with respect to dt and not dx^ we can define the differ- 
entials for the time-dependent coordinates {x^{t)} 

dx^ = -tg'^i^) (A, [H,p])^dt + g^^{^) (A, V[L]{p))^dt, (4.36) 
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meaning that 

dp^DjdxK (4.37) 

4.4 Projections in the tensor product submanifold 

Our ultimate goal is to form a projection from a general state over n qubits, to the 
closest n-fold tensor product of a single qubit state. We will define V to be the 
submanifold of 5(2") which describes the space of all states of the form p(x)®". This 
space has the simple parameterization q : B <Z'M? ^ V d <S(2") such that 

^(x)^p(x)^'^ = l(l + x-<". (4.38) 

We also need to identify the tangent spaces for each point in the submanifold. 
Because of the linear nature of the one qubit map p the directional derivative of ^(x) 
with respect to y is simply 



d 

Dy = —g{x + Xy) 



(4.39) 

A=0 



A derivative acting on a tensor product must obey the Leibnitz rule. The directional 
derivative of p(x)®" in the direction y must then be equal to 



Dy{g{^))^^p{^ + Xyr- 



n 

^p(x)^^-i(8) -2/-o-®p(x)®'^-\ (4.40) 



A=0 i=i 2 



For the single qubit, the directional derivative was uniform over the manifold, 
which implied the Euclidean geometry for our simple metric. For multiple qubits, 
this is no longer the case, which implies that V has a richer geometry. With a slight 
abuse of notation, the basis vector associate with the coordinate x\ evaluated at the 
state p(x)'^" will be notated A(x) and is given by 

n 

A(x) = J]p(x)®^-i ® \a, ® p(x)®"-^'. (4.41) 



Chapter 4. Projection Filtering for Qubit Ensembles 



149 



The tangent space at ^(x) is then 

T,(x)P = span { A(x) : i^x,y,z}. (4.42) 

The metric on V induced from the trace inner product is now easily calculated. 
The product of the two basis vectors Di and Dj is equal to 



p,q=l 
n 

2^ p (g) jCTj Uj ® p 

p=q=l 
n 

+ 2^ p (8) ^CTjp (8) (8) ^paj (g) p 



(4.43) 



9>p=i 

+ 2^ p (8) ^paj ® p ® ^Gip ® p 

q<p=l 



The metric coefficient is then 

" 4 

r 

2 



= ^ Tr(p2)'^-^ Tti a, a, ) + ^^^^ IV(/)'^-2 Tr(p a,) IV(p a,) (4.44) 

^ /-, I l2\"-l r n(n — 1) . , ,2xn-2 k- f r r 

= ^ (1 + 1^1 ) + on (1 + 1^1 ) ^ ^ '^'t^^^^- 



We will often need to calculate the product between several collective operators 
and then take the trace. While Eq. (4.43) has a distinct ordering to the tensor 
products, resulting in the two sums p < q and q < p, upon taking the trace this 
order becomes irrelevant. Thus, there are only two relevant terms: p — q and p ^ q. 

4.4.1 The metric in spherical coordinates 

The metric as given by Eq. (4.44) has a simple form when written in spherical co- 
ordinates. In terms of the Cartesian basis vectors {e^;, e^, e^} the standard spherical 
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basis vectors are defined as, 

= sin 9 cos (pe^ + sin 9 sin (f)ey + cos 9 

ee — cos 9 cos e^; + cos 9 sin (t)ey — sin ^ (4.45) 
= — sin + cos . 
In analogy, we will define the associated tangent vectors, 

Dr{'x) = sin 9 cos D^ix.) + sin ^ sin Dy{x.) + cos 6* -D^(x) 

(x) = cos 9 cos (x) + cos 9 sin (x) - sin 9 (x) (4.46) 
D^{y:) — — sin -Dx(x) + cos -Dj/(x). 

When x is in the subset {xeB: (0<r<l, 0<^<7r, O<0< 27r)}, these vector 
fields form a perfectly valid basis for each tangent space T^(x)^- 

It will also be convenient to define "spherical" Pauli matrices, 
= sin 9 cos cr^; + sin 9 sin cTj, + cos 9 cr^ 

(75t = cos 9 cos (7a; + COS ^ sin (t)ay — sin ^ (T;^ (4.47) 

(J0 = — sin + cos (j) Gy. 

These operators obey the usual properties associated with Pauli matrices, in that for 
i, e {r,6',0} 

Tr(ai) = (4.48a) 

Tr((7i CTj) = 25ij (4.48b) 

[cTj, CTj] = iEijk^ak (4.48c) 

(cTiCTj- + CTjCTi) = dij 21. (4.48d) 

Furthermore, we have that 

A(x) = pW^'"' ® i'^i <^ p(x)®"-^^ (4.49) 
i 

for both Cartesian and spherical bases. And the state p(x) can now be written as 

p(x) = i(l + rcT,). (4.50) 
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We can now use the fact that the Pauh matrices are orthogonal, and the fact that 
the state p is now orthogonal to and o",^ to evaluate the inner product between 
the spherical tangent vectors, and thus write the metric as a matrix in spherical 
coordinates. Prom the general expression 

(A, Dj) = inTr(p2)"-i Tr(a,a,) + |n(n - 1) Tr(p2)"-2 Tr(a,p) Tr(a,p), (4.51) 

we have 



{De, De)=l{l + r^) 



,n-l 



{D„ D,) = ^ (1 + r') 



,n-l 



and 



(4.52) 



{Dr, De) = {Dr, D^) = {De, D^) = 0. 



(4.53) 



As a matrix, the metric in spherical coordinates is given by 



n-1 




1 
1 



\ 



(4.54) 



and its inverse is 



G-i(x) = -(l + r^) 



2\-C"-l) 







1 



v 



1 

Notice that when n = 1 we recover the simple Euclidean metric of gij 



(4.55) 



2 



4.4.2 Calculating collective operator inner products 

This section contains the detailed calculations necessary for projecting the various 
components of the conditional and unconditional master equations onto the space of 
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identical separable states. We will first derive the projection for a general conditional 
master equation of the form 

dptot = -^[-f^tot, Ptot]dt+V[Ltot]{ptot)dt+Ic[hot]{Ptot)dt+H[Lt^t]{Ptot)°dw^ (4.56) 

The subscript tot is used to specify that these operators are operators on the total 
Hilbert space consisting of particles. Any single particle operator A, acting on the 
n*'* particle of the ensemble, is denoted by A'^"'^ and is given by the tensor product, 

The fundamental assumption for this derivation is that the operators i?tot and 
Ltot, act independently and identically on each each qubit and may be written as 

N N 

//tot = Yl = Yl ® ^ ® ^^^"'^ ^^•^^^) 

n=l n=l 

TV TV 

n=l n=l 

Furthermore, the tangent vectors Di, for the single particle state p are 

N 

L). ^^p^^-i^iai^p®^-'^. (4.59) 

n=l 

In projecting the collective master equation onto the identical product states, 
we will need to calculate the product of up to three collective operators and then 
take the trace. Each collective operator is composed of a sum over single particle 
operator, each acting on n*'* member. When taking the product of sums there will 
be N terms where both single particle operators act on the same subsystem, as well 
as N{N — 1) terms where the constituent operators act on different systems. 

The simplest case is when there arc no collective operators i.e., simply calculating 
the overlap between and the state g. This is equal to 

TV 

(A, ^?) = VTr ((p««-i ® ia, ® p^^--) g) 

^ (4.60) 

^NTr{pY-'TT{\a,p). 



Chapter 4. Projection Filtering for Qubit Ensembles 



153 



The next step up in complexity is to include a single collective operator, Atot- This 
requires two sums, the sum from Di and the sum from ^Itot- We then have 

(A, Act ^?) = E ((p^™-' ® ® ^-"^) A^p^^) 

n,m=l 

AT 

Tr ( p ^CTi yip (g) p I 

n=m=l 
N 

+ ^ Tr ( p (g) pylp (8) p ® ^(Jip® p I 

n<m=l 
N" 

+ 2^ Tr ( p^ ®^CFip®p ®pAp®p'' 1 

n>m=l 

= 7VTr(p2)^-iTr(|(7,Ap) + 7V(7V - l)Tr(p2)^-2 Tr(|(7,p) Tr(Ap'). 

(4.61) 

The cyclic property of the trace shows us that upon switching the order of Atot and 
{i.e. to instead calculate (A) Q^tot)) the second term will be left unchanged, so 

(A, Mtot) = 7VIV(p^)^-^IV(|a,pA) + 7V(7V-l)IV(p2)^-2 Tr(|a,p) IV(Ap2). 

(4.62) 

When calculating the projection of the dissipator terms, we need to calculate the 
product of two collective operators, which will have a triple sum. For two collective 
operators Atot and Stot we have, 

N 

{Di,A,,tB,,,g)= J2 Tr((p^"-i®ia,®p®^-'^)A('")i?«p®^). (4.63) 

n,m,l=l 

The previous result shows us that there will be five distinct terms, for cases where 
n — m — l,ny^m = l,n = m^l,n = l^m, and n ^ m ^ I. This expression then 
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simplifies to 

+ N{N -l)Tr{pY~^ Tr(|aip) Tr{pABp) 

+ N{N-l)Tr{pY~^ Tr(|(7i^p) Tr{pBp) 

+ N{N -1)Tt{pY~^ TrC^diBp) Tr(pAp) 

+ Ar(Ar- l)(7V-2)Tr(p2)^-3 Tr(|aip) Tr{pAp) Tr(p5p). 

(4.64) 

The terms Tr(pXp) can be simplified to Tr(Xp^) but were left to make it more 
explicit. The order of the collective operators can be exchanged, but this won't 
effect the five term structure. Thus we calculate that 

(A, AtotgBtot) =NTr{pY-'Tr{la,ApB) 

+ N{N -1)Ti{pY~^ T^ik'^iP) T^ipApB) 
+ N(N - l)Tr(p2)JV-2 Tr(|Mp) Tr{B p') 
+ N{N-l)Tr{pY~^ Trilcj,pB) Tt{Ap^) 
+ Ar(7V-l)(7V-2)Tr(p2)^-3 Tr(|(7,p) Tr(Ap2) Tr(Sp^) 

(4.65) 

and 

(A, ^^tot Act) =NTr{pY-'Tr{la,pAB) 

+ 7V(7V- l)Tr(p2)^-2 Tr(|(7,p) Tr(ASp2) 

+ N(N-l)Tr{pY-' TrilaipA) Tr(5p') 

+ Ar(Ar - 1) Tr(p2)^-^ TiilaipB) Tr(Ap2) 

+ 7V(7V-l)(7V-2)Tr(py-=^ Tr(i(7,p) Tr(Ap2) Tr(Sp2). 

(4.66) 

The final two calculations involving collective operators are the expectation val- 
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ues, (Aot) and (Atot-Btot)- They are 



{Atot)^Tr{Atotg)^NTr{Ap) 



(4.67) 



and 



(AotStot) = Tr(AotStot g) ^NTr{ABp) + N{N - 1) Tr{Ap) Tr{B p). (4.68) 



The general expressions in Eqs. (4.60-4.68) are all we need in order to calculate 

all of the terms in the conditional master equation and a tangent vector Di. Starting 
with the Hamiltonian commutator, Eq. (4.61) and its permutated version gives, 



For the dissipator term, substituting in to Eq. (4.64-4.66) with the appropriate 
collective operator L^qi or lI^^, 



Note that final lines in Eqs. (4.64-4.66) are all equal and hence cancel in the dissipator 
term. 

For the conditioning map ^/[Ltot] we again need Eq. (4.61), with the addition of 
Eq. (4.60) and the single operator expectation value Eq. (4.67). Its overlap then 
reduces to 



(A, [//tot, g])^NTr{pY-'TrC^a,[H, p]). 



(4.69) 



(A, v[Ltot\{g)) = Tr (a (i^tot^i^Iot - k^ltLtotg - IgLltUot) ) 

= ArTr(p2)^-iTr(ia,P[L](p)) 



+ N{N-l)T,{p'f-' Tr(|a,p) Tr(pP[L](p)) 
+ N{N - 1) Tr{pY-' TV(|<7, [L, p\) IV(Ltp2) 

+ N{N - 1) Tr(p2)^-2 Tr(|a. [p, Lt]) Tr(V). 



(4.70) 



(A, n[UM)^Tr{D., (Ltot^ + ^LL-(i^tot + 4ot) ^?)) 



= iVTr(p2)^-iTr(ia,(Lp + pLt)) 

+ N{N - 1) Tr(p2)^-2 Tr((L + Lt) p") Tr(|(7, 
-Ar2Tr(p2)^-iTr((L + Lt)p) Tr(ia,p). 



P) 



(4.71) 
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Finally for the general Ito correction map Eq. (3.86), 

(A, ULM) ^NTr{pY-'Tr{la, {LpL^ + |LV+ ipL^') ) 

+ N{N - 1) Tr(p2)^-2 Tr(ia, p) Tr {p{LpL^ + \L^p + ipLt 2)) 
+ iV(iV - 1) Tr(p2)^-2 Tr(|a,(Lp + pLt) ) Tr((L + Lt)p2) 
- Tr(p^)^-i ( (LtL + IL" + ^L^ t) 

+ i(iV-l) (L + Lt>^)Tr(ia,p) 
-7V(L + Lt> (A, ^[i^totK^)). 

(4.72) 

These expressions simplify, in the case of a Hermitian operator L acting on a 
single qubit state. Specifically when L = J^^ and H is the control Hamiltonian in Eq. 
(4.8), Eqs. (4.69-4.72) simplify to 

(A, [i^tot, q]) = iNTripY-' Tr (a, [r{t)aj, p]) , (4.73) 



(4.74) 



(A, V[J,]{q)) =lNTi{pY-' {^H(^^^zP(r,) - NTiia.p)) 

+ lN{N - 1) Tr(p2)^-2 Tr{pa,pa,) Tr{aip), 

(A, n[J,]{g)) ^iNTripY"' Tr{ai{a,p + pa,)) 

- iN'TripY-'Tiia^p) Tr(a,p) (4.75) 
+ lN{N - 1) Tr{pY-' Tr(ay) Tr(a,p), 

(A,m](^)) = (A, m](^)) 

+ |iV(iV - 1) Tt{pY-^ Tr(a,p2) Tr (a, (a,p + pa,) ) 

- IN\N -l)Tr{pY-'Tr{a,p)'Ma,p) (4.76) 
+ |7V(7V -1){N- 2) IV(p^)^-^ IV(<7,p2)2 IV(a,p) 
-NTr{a,p) {D„ 'H[J,]{q)) . 
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It is worth noting that in Eqs. (4.74-4.76), a majority of the terms are proportional 
to Tr(o'jp). If the state has zero expectation along the ai axis, then the overlap 
with that tangent vector will be greatly simplified. However, any qubit state (which 
is not completely mixed) has a Bloch vector pointing along some axis, leaving the 
orthogonal axes with zero expectation. When the state happens to align with the 
Dj, i.e. the Bloch vector is x = re^, Tr((Tjp) = r. This suggests that these terms 
may simplify for any state, if we choose to work in spherical coordinates. 

4.4.3 The spherical projection of the CME 

With the metric inverse in Eq. (4.55), and the inner product expressions in Eqs. 
(4.75-4.76) we can finally calculate the projection coefficients /i*, c', 6*. To fully 
simplify the inner products we need the following relations, which are easily calcu- 
lated: 



r for i = r 



< for i = ^ 



(4.77a) 



for i — (f) 




(4.77b) 



Tr((7j (7^ pcr^)^ < 



rcos{29) for i — r 
-rsm{20) fori^e 
for i — (f) 



(4.77c) 



Tr{a,p) = Tr{a,p^)=rcos{9), 



(4.77d) 
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and 

Tr{pa,pa,) = |(1 + cos(2e)). (4.77e) 

The azimuthal symmetry of the problem is directly apparent in these expressions, as 
the Jz projection carries no information about 4>. 

Substituting the spherical Pauli matrices into Eq. (4.75), the Hamiltonian inner 
products simplify to 

{Dr, -i[H, g]) = 

{Dg, -i[H, q]) = inTV(p2)-V cos ct>-f{t) sin ct>) 

{D^, -i[H, q]) = ^nTr(/92)"-i r {f{t) sin 9 - f{t) cos 6 cos cj) - f{t) cos 9 sin (/>) . 

(4.78) 

Physically, applying magnetic fields to a spin ensemble cannot change the total mag- 
netization, a fact that is confirmed by having the Dr projection be zero. 

The remaining projections we need to calculate are all based on the mea- 
surement operator and so contain no information about the coordinate. This can 
be verified by substituting the tangent vector into Eqs. (4.74-4.76), which all 
evaluate to zero. 

The Do projections are greatly simplified by the fact that Tr(crg p) — 0. Using 
Eq. (4.77c) we find that 

{Dg, V[Jz]{g)) = -|nTr(p2)"-irsin(2^). (4.79) 
Prom Eq. (4.77b) the conditioning product reduces to 

{Do, n[JM) = -|nTr(p2)-i sin(^). (4.80) 
And combing these two results into Eq. (4.76) we have 

{Do, Ic[Jz]{g)) = - |nTr(p2)"-irsin(2^) 

- \n(n - 1) Tr(p2)^-2 r sin(2e) (4.81) 
+ |n2Tr(p2)"-irsin(2^). 
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Simplifying the r projections is obviously a more complicated task. However, the 
dissipator product with Dr is not particularly more difficult than the Dg product. 
By including the fact that Tr{arp) — r and substituting Eqs. (4.77c and 4.77e) into 
Eq. (4.74) we find 

{Dr, V[J,]{q)) = -lnTr{p^)''-\l + nr^)r sm\e). (4.82) 

Evaluating the conditioning map requires Eqs. (4.77c) and (4.77d) which reduce Eq. 

(4.75) to 

{Dr, = -lnTr{p'r-'{l + nr'){r' - l)cos(^). (4.83) 

Finally when we combining these past results into the Ito correction product, Eq. 

(4.76) simphfies to 

{Dr, IciJ.M) = - lnTr{p'r-\l + nr')rsm'{e) 
+ |n(n - 1) Tr(p2)"-2 r cos^{e) 

- ln\n - 1) Tr(p2)-V3 cos2(^) (4.84) 
+ in(n- l)(n-2)Tr(p2)«-3r2cos2(^) 
+ in2Tr(p2)^-2(i + rir^){r^ - l)rcos'(^). 

Having now simplified the the projections, we are able to include the inverse met- 
ric components in Eq. (4.55) to arrive at the proper projections. The Hamiltonian 
projections are 

/i''(x,t) = 

h'{^,t) = g'' {De, -i[H,Q]) = f{t)r cos - fHt)r sin ^ 

h'^{x,t) = {D^, -i[H, g\) = f{t)r sin 6 - f{t)r cos ^ cos - f{t)r cos 9 sin (j). 

(4.85) 

The dissipator projections are 

f (x) = {Dr, KV[J,]{g)) = -iKrsin^e 

Z^(x) = {De, kV[JM) = -|«rsin2e (4.86) 
Z^(x) = 0. 
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The conditioning projections are 

6'^(x) = (Dr, y/KH[J,]{Q)) = - T^) COS ^ 

6^(x) = {De, V^nUio)) = -v^sin^ (4.87) 
6'^(x) = 0. 
The Ito correction projections are 

c'-(x) = Z'- (L>^, = r(x) + «;Q;(r)rcos2^ 

c^(x) = (D,, «:Xc[J.](i?)) = /'(x) + i«:/3(r)rsin2^ (4.88) 
c'^(x) = 
where we defined the coefficients 

/2 .^ 2(n-l) n(n-l)(l + r2) ^ 2(n-2)(n-l) 2 oon 
(1 + nr^j 2(1 + nH) (1 + (1 + nr^) 

and 

71 — 1 

p{r)=n-2-—. (4.90) 
1 + 

To bring all of this together, we are reminded that the projected conditional 
master equation is 

dpt = {h' + l' -c')Didt + b'DiO dwf (4.91) 

Furthermore a general Stratonovich SDE is traditionally written as dxt = A{xt) dt + 
B(xt) o dwt- To conform to this convention we will define the coefficients 

a''(x, t) = — K, a{r) r cos^ 9 

d%x,t) =h\x,t) - lK/3{r)rsm2e (4.92) 
a'^(x,t) =/t'^(x,t). 

with the Hamiltonian projections h^{^,t) defined in Eq. (4.85). The projected con- 
ditional master equation is finally given by the Stratonovich SDE 

dpt = a*(x, i) A(x) dt + 6^(x) A(x) o dwt (4.93) 
with the spherical tangent vectors defined in Eq. (4.46). 
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4.5 The Projection Filter 

From the projected conditional master equation in Eq. (4.93), we would like to 
find a closed set of easily simulated stochastic differential equations. We now have 
a nonlinear matrix-valued SDE that propagates the closest identical product state 
to the exact conditional state. A single qubit state is completely characterized by 
its Bloch vector, and so to simulate pt we need only find SDEs for the three Bloch 
components. 

In the notation of quantum filtering theory [25] , the filter is a QSDE that prop- 
agates the conditional expectation of a given observable, 

7rtiX)^Tr{ptX). (4.94) 

The filter is generally expressed as a differential, so that for the observable X (on n 
qubits), we have 

dntiX) = TT{dpt X) = a'(x, t) Tr(A(x)X) dt + 6^(x) Tr(A(x)X) o dwf (4.95) 

The Bloch vector components can of course be identified by the expectation 
value of the Pauli operators. For n qubits, wc also have the relation that Tr( Jj g) = 
|Tr((7ip) = so to extract SDEs for the Cartesian Bloch components from 
dpt, we simply need to calculate the filtering equations for the operators 2 Jj/n. Thus, 

dxl = ^ Tt{dpt Ji) = ^ (a"(xt,i) Tr(L',(xt) J,) dt + 6"(xt) Tr(L',(xt) J^) o dwt) 

(4.96) 

for a e {r, 6, 0} and i e {1, 2, 3}. Explicit calculation shows that 
Tr(£)r(x)Ji) = — (sin^ cos 05,1 -|-sin^ sin 05,2 -l-cos^^js) 

Tl 

Tr(D5i (x)Jj) = — (cos 6* cos05ii -|- cos 6* sin 05^2 — sin 6* 5,3) (4.97) 
Tr{D^{yi)Ji) = - ( - sin05u + cos0 ) ■ 
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These equations show how to find a mixed coordinate expression for the projection 
filter, i.e. it expresses dx in terms of the variables r, 6, 0. We would like to combine 
these results with the expressions for a" and b", Eqs. (4.92 and 4.87) to obtain 
deterministic and stochastic coefficients expressed in Cartesian coordinates. We seek 
the functions, a*(x, and 6*(x) such that 

dxi = a'(xt, t) dt + b'{xt) o dwf (4.98) 

With the standard conversion between spherical and Cartesian coordinates, Eqs. 
(4.85, 4.92, 4.87, and 4.97) we can easily find these coefficients. 

In Cartesian coordinates the deterministic Stratonovich coefficients are 
a^(x, t) = fit) x' - fit) x'-Ki air) + /3(r) ) x' 

a'(x, t) = fit) x^ - fit) x''-Ki air) + /3(r) ) x^ (4.99) 
d%x,t) ^fit)x''-fit)x^ + Kf3ir)x^-Kiair) +/3ir))x' 



3 [x-- 



and the stochastic coefficients are 



b'\ 


:x) = 


— \fKX^ X^, 


b'\ 


:x) = 


—\fkx'^ x^. 


b^\ 


:x) = 


v^(l - i^f 



(4.100) 



Note that the functions air) and ^(r) actually only depend upon = 

To complete the derivation we will convert these Stratonovich equations back to 
the Ito form. The Ito correction for the multivariable U coefficients is given by, 

Aa^(x) = a^(x,t) - ~a\^,t) = -6^(x)^^. (4.101) 

Substituting 6*(x) into this formula, the Ito corrections simplify to 

Aa\^)=KX^ ((a^')'-l) 

Aa2(x)=«x2((a;3)2-i) (4.102) 
Aa3(x) =KX^ ((a;3)2-l). 
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By adding these terms to the deterministic coefficients in Eq. (4.92) we find 
a^(x, t) = f{t) - f{t) - \k x^ + «7(r) x^ {x^'f, 

a'(x, t) = f{t) x^ - f\t) x^-\kx'' + k 7(r) {x^f, (4.103) 
a3(x, t) = f \t) x^ - pit) x^ + K (/3(r) - 1) a;^ - K^{r){xy 
were we defined the new coefficient function 7(r) as 

^ _ c.(r)+/?(r) ^ f ^(^ + 1) (4 104) 

and the /3{r) coefficient is given in Eq. (4.90). Note that for any n and < r < 1, 
the coefficient 7(r) is strictly nonnegative. The zeros of this function however are 
two special cases, which we will discuss next. 

If we substitute the stochastic coefficients 6'(x) into Eq. (4.101) the projection 
filtering equations are 

dx] = a^(xj, t) dt — y/nx] x^ dwt 

dx] = a2(xt, t) dt-y/Kx] xl dwt (4.105) 
dx^ = a^(xt, t) dt + \/fi-(l - (xj )^) dwt- 

4.5.1 Special cases for the projection filter 

There exist two very interesting special cases for the projection filter. The first is 
when we only have a single qubit, and the second is when the state is pure. We 
have already shown how, when n — 1, the metric is simply a Euclidean metric. (Up 
to a factor of a half.) Therefore, we expect the projection filtering equations to 
dramatically simplify for a single qubit. The first thing to notice is that when n — 1 
the /3(r) coefficient Eq. (4.90) simplifies to 

/3(r) = 1 forn = 1. (4.106) 

Furthermore, for n—1 the 7(r) coefficient Eq. (4.104) simplifies to 

7(r) = forn = l. (4.107) 
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When we evaluate the projection filter for pure states, we arrive at remarkably similar 
results. In other words for any n = 1,2, . . . we also have 

p{r = 1) = 1 and 7(r = 1) = 0. 

These fantastic simplifications means that for a (possibly mixed) single qubit or 
any pure multi-qubit state, the projection filtering equations are simply 

dx] = (/^(i) x^ - f^{t) x^ - ^Kx]) dt - ^/Kxl x^ dwt 

dx^t = {fit) x\ - f\t) xl - \kxI ) dt - xl dwt (4.108) 

dxl = {fit) xl - fit) x\ ) + (1 - ix\f) dwt. 

It is an interesting exercise to see that if one simply computed the Heisenberg picture 
filtering equations for the Pauli operators, ^ti'^i), one arrives at these vary same 3 
coupled SDEs. 

What the pure state evaluation says is that when r — 1, the dynamics of the 
separable system no-longer depends on the total number of qubits and evolve simply 
as n identical copies of a single qubit state. The only remaining dependence upon 
n is in the innovations process, where we have the differential dwt — > dvt = dyt — 
y/Knx'Idt, where dyt is the integrated measurement current in time [t,t + dt]. 

One finial question about the pure state projection filter is, "Will the filter remain 
pure, once it becomes pure?" In other words, is r = 1 a trap for the system so that if 
Tg = 1 for some time s, will = 1 for all t > s. The physics of the situation dictates 
that it must, as with no sources of decoherence the state will purify and stay pure. 
However, it is difficult to see that this is indeed the case simply by inspecting Eq. 
(4.108). Where this not the case, then it would hkely indicate an error with the 
model. Thankfully the answer is decidedly yes and can be seen in the spherical basis 
representation of dpt, with the Stratonovich coefficients a(x, t) and 6(x, t) given in 
Eqs. (4.92) and (4.87). From these two equations we see that 

d''ix,t) = -Kair)rcos^9 (4.109) 
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and that 

6''(x,t) = -y;^(l -r^)cos^. (4.110) 

Substituting in for r = 1 into Eq. (4.89), we find that a{r = 1) = 0. But this means 
that a''(x, t)|r=i = and 6''(x, t)|r=i = 0. Following this logic to its conclusion we 
find that 

dp\r=i = a^(x, t) \r=i Do dt + 6^(x, t) \r=i De o dwt 

(4.111) 

+ a'^(x, t) \r=l dt + 6'^(X, t) \r=l O dWf 

In other words, by evaluating dp at r = 1 we see that it is independent of the 
tangent vector Dj. and thus will remain on the submanifold of identical separable 
states defined by r = 1. 



4.6 Simulations and Performance 



The primary purpose for deriving the projection filter was to find a reduced dimen- 
sional description of a system of qubits undergoing a continuous measurement of 
the collective angular momentum J^. However we know that the end result of a 
continuous measurement of J^, absent of any other influence, is an eigenstate of J2, 
a so called Dickc states. With the exception of the stretched states, Dicke states 
are not separable. Even after a relatively short time, the reduction of the spin 
component results in a nonclassical spin squeezed state, a phenomena observed in 
several experiments [8-10]. 

In this section we test through numerical simulation how well the projection fil- 
ter reproduces certain properties of the exact conditional state. Here we show that 
by adding strong randomized external control fields during the collective measure- 
ment the joint state remains highly separable and the approximate description of the 
projection filter reproduces collective expectation values much more faithfully. 
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Chap. 5 uses the projection filter in an algorithm to reconstruct the initial con- 
dition of a SCS from a continuous measurement of Jz, characterized by the rate k. 
In order to obtain information about observables other than J^, an external control 
Hamiltonian must be applied. For reasons discussed in Sec. 5.3.1, this takes form 
of a sequence of global 7r/2 rotations, where each rotation is about an axis n that 
was independently sampled from a uniform distribution. Therefore, we will test the 
performance of the projection filter with this control law in mind. 

Fully characterizing the control amplitude f{t) requires specifying the amplitude 
and duration of each pulse, as a larger Larmor frequency is needed to enact the 
same rotation in a shorter time. For simplicity, we will fix f{t) to have a constant 
magnitude and will only vary its direction. This constrains the 7r/2 rotations to be 
square-wave pulses, each of duration r. 



where X[a,b){t) is the indicator function for the interval [a, b) and {rtm} are i.i.d. unit 
vectors drawn from a isotropic distribution. 

To efficiently simulate the exact dynamics we utilize two conserved quantities. 
The first is that because the system Hamiltonian Ht and measurement operator 
L commute with J^, the total angular momentum of the atomic system will be 
conserved. Furthermore, the states we ultimate use arc all initialized in states with 
a maximum projection of angular momentum along some direction, thereby always 
possessing n/2 units of angular momentum. This allows us to restrict the simulations 
to a d — n + 1-dimensional space. In other words we simulate a single J — n/2 spin 
system. 

The second conservation property we will use is the fact that without any addi- 
tional sources of decoherence, the conditional master equation maps pure quantum 
states to pure quantum states. Sec. 3.4.3 discusses the conditional Schrddinger equa- 
tion (CSE) and how it can be derived from a conditional master equation (CME). 




(4.112) 



m=l 
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Using a CSE generates significant computational savings, as each time step prop- 
gates a single a complex vector, rather than a complex matrix. The general form of 
the CSE is given in Eq. (3.94). In our case, L — y/RJ^ for a real, positive k this 
simplifies to 

d\iJt) = ( -iHt - Ik{J, - (J,) f ) liPt) dt + V^{J, - (J,) ) \iJt) dwt. (4.113) 

4.6.1 Simulation parameters 

In absence of the control Hamiltonian Ht, the one universal timescale in the CSE 
is set by the measurement strength, i.e. the characteristic time k"^. Therefore 
these simulations are all reported in time units of this characteristic time. The 
range of qubits total qubit numbers we will test are between 25 — 100, meaning that 
the simulations will be of collective spin values of 12.5 < J < 50. In addition to 
these collective spins, we will compare the projection filtering equations to the exact 
simulations for a single qubit, proving that they generate the same dynamics. 

The remaining parameters, namely the gate duration r and the fixed terminal 
simulation time tf will be chosen to correspond to the parameters that will be ulti- 
mately used in Chap. 5. Specifically, t — 5 x 10~^ and tf — 0.2 k,~^. 

The actual simulations are implemented in the MATLAB computing environment 
using a hand coded, weak second order predictor-corrector stochastic differential 
equation integrator. The algorithm is described by Kloeden et al. [69, page 200] and 
was implemented in MATLAB by Brad Chase for his PhD dissertation [37]. 

4.6.2 Spin squeezing comparisons 

Spin squeezing is a much sought after and well studied effect in atomic spin ensembles. 
The general phenomena describes the reduction in uncertainty in a expected value 
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of a spin component transverse to the mean spin direction. Due to a Heisenberg 
uncertainty relationship, this reduction in uncertainty is accompanied by an increase 
in uncertainty in the orthogonal quadrature. 

The standard example is to consider an collective spin system composed of n 
qubits, initialized in a SCS pointing along the +6^ direction. This state is clearly 
an eigenstate of Jx with eigenvalue nix — J — n/2. It is also easy to show that this 
state is a minimum uncertainty state so that it minimizes the Heisenberg uncertain 
relation 



is a state that is still mostly polarized along the e-r axis but also contains quantum 



A spin squeezed state is one of the immediate consequences of a continuous mea- 
surement of a collective angular momentum variable, such as [71], for a SCS 
prepared transverse to the measurement axis. A number of papers have investigated 
the relation between spin squeezing and various measures of entanglement (see e.g. 
[72] and references therein). One particular measure of spin squeezing, has been 
shown by Yin et al. to be directly related to the concurrence, a measure of pairwise 
entanglement [72]. They show that when the concurrence C is greater then zero, 
indicating entanglement, then <1 and that when > 1, C = and the state is 
unentangled. takes on the following definition. 

For each component of angular momentum we can compose the symmetrize cor- 
relation and covariance matrices (i, j = x, y, z) 




(4.114) 



with equal uncertainties (AJ|) = (AJ^) = \ J- An example of a spin squeezed state 




Corrjj — I {JiJj + JjJi) 



(4.115) 
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and 



Covarjj = Corrjj - ( Jj) (Jj) . 



(4.116) 



Prom these matrices we can also form the Hermitian matrix 



r = (n — 1) Covar + Corr . 



(4.117) 



The squeezing parameter is then defined as 



^ (J2) - f 



2 ^min 



(4.118) 



where Amin is the minimum eigenvalue of the matrix F. 
4.6.3 Squeezing simulations 

This section presents simulations that benchmark the typical effect the measurement 
has upon the states of interest, as well as how much the control law mitigates these 
effects. We test here five classes of states, each composed of n = {1, 25, 50, 75, 100} 
qubits. The one qubit case is included as a control, testing that the numerics produce 
reasonable results. 

As discussed previously, a continuous measurement of is a standard protocol 
for producing a spin squeezed state. However, a spin coherent state (SCS) prepared 
along to the axis will not squeeze at all, while the states prepared in the equatorial 
plane squeeze the most. Therefore to demonstrate the maximum amount of quantum 
correlations a typical measurement realization can produce, a natural choice is a SCS 
prepared along the e^; axis. 

Fig. 4.1 characterizes the typical results of a CSE simulation for a +6^; SCS 
initial state, containing n = 50 qubits, (spin J = 25). Figs. 4.1. a - 4.1.C show the 
conditional expectation values of J^, Jy and as a function of time. Also plotted are 
1(7 regions of confidence indicating the expected deviations from these mean values. 
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Figure 4.1: Typical Uncontrolled Evolution. CSE evolution for a +6^ SCS ini- 
tial state, containing n = 50 qubits, (J = 25). a-c: The conditional expectation 
values for J^, Jy, and vs time, including a la region of confidence, d: The 
squeezing parameter in dB vs time, e: The Husimi Q-function q^{6, (p) for the 
conditional state, \iptf), at the final time tj = 0.2 



In other words, the grey regions are bounded by the values (t/j) ± ^/{KJf). Fig. 
4.1. a shows how (Jj.) tends to decrease as the state squeezes around the sphere and 
how its uncertainty grows. Fig. 4.1.b shows how {Jy) remains zero throughout the 
measurement, while its variance increase due to the characteristic ant i- squeezing. 
Conversely, Fig. 4.1.C shows the decrease in (AJ|) due to the squeezing as well as 
the deviation of (J^) from zero as the system evolves towards an eigenstate of J^. 
Fig. 4.1.d plots the evolution of the squeezing parameter in dB, as a function of 
time. ( Where in decibels is 10 logiQ(^|.). ) Fig. 4.1.e shows a single 3D plot of 
the Husimi Q-function quasi-probability distribution for the state at the final time 
tf. The Q-function for a pure spin state ip with total angular momentum J is defined 



Chapter 4. Projection Filtering for Qubit Ensembles 



171 



as 



o J \ 1 



(4.119) 



where \6,(p) is a SCS parameterized by the polar angles 6 and (p. The constant 
factor ensures normalization. The color scale of Fig. 4.1.e has been normalized to 
the maximum value of ^"^^^ The fact that the Q-function has a maximum value of 
~ 2.2 shows that this squeezed state has poor overlap with SCSs. 
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Figure 4.2: Typical Controlled Evolution. CSE evolution for a +6^; SCS initial 
state, containing n = 50 qubits, with 40 randomized 7r/2 rotations, a-c: The 
conditional expectation values for J^, Jy, and Jz vs time, including a la region 
of confidence, d: The squeezing parameter in dB vs time, e: The Husimi 
Q-function q^{6, (p) for the conditional state, \iptf), at the final time tj = 0.2 k,~^. 



Fig. 4.2 presents a typical realization for the same initial condition in the presence 
of the randomized tt/2 rotations, with a duration of r = 5 x 10^^ k^^. By the time 
tf = 0.2 K~^, this period leads to a total of 40 rotations or one for every horizontal 
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tick mark in Figs. 4. 2. a - 4.2.d. Figs. 4. 2. a - 4.2.c show the conditional expectation 
values (Jx), {Jy) and (J^) as a function of time. In contrast to the example, lacking 
the randomized controls, these Figs, show that there is little qualitative difference 
between the three expectation values. Fig. 4.2.d indicates that there is a significant 
reduction in the amount of squeezing produce during measurement compared to the 
uncontrolled system. ^Tmin — —3.21 dB in the presence of controls while ^Tmin — 
— 10.1 dB without them. The la confidence regions indicate that this squeezing is 
not with respect to a fixed coordinate axis but is rotated between all three and 
so there is a substantial averaging effect that leads to far less squeezing than the 
uncontrolled case. In fact there is nearly a factor of 5 decrease in the maximum 
amount of squeezing and therefore the controlled state is kept much more separable. 
This separability is indicated in the Q-function of the final state, shown in Fig. 4.2.e, 
with its more spherical appearance and near perfect overlap with a spin coherent 
state, indicated by the maximum value ~ 3.7. 

The amount of squeezing is significantly reduced because the randomized controls 
tends to mix both the squeezed and anti-squeezed components leading to a near zero 
average. Not only does the mean spin rotate, but the orientation of the squeezing 
ellipse also rotates. As the rotation axes are chosen from a uniform distribution, the 
squeezed component is just as likely as the anti-squeezed component to be oriented 
along the measurement axis. At any given time, the uncertainty in the Jz component 
is equally likely to be above or below the uncertainty of an equivalent spin coherent 
state. Therefore it is difficult for any significant squeezing to develop. 

4.6.4 Projection filter simulations 

Ultimately we need to compare how well the projection filter performs when it 
calculates an innovation from a measurement record yt that is not generated by 
a separable state. In this case dwt is actually given by the innovation process. 
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dvt = dyt — ^/Knx^ dt, which is not a Wiener process in all cases. We make this 
comparison through two measures. The first is to see how well the projection filter is 
able to reproduce the expectation values {Jx), {Jy) and (J^) compared to the exact 
conditional state. The second is through the fidelity, squared overlap, between the 
exact and approximate states. Fig. 4.3 makes the comparison between the projection 
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Figure 4.3: Projection Filter Tracking example. The comparison between the 
projection filter predictions and the exact conditional expectation values for the 
simulations shown in Figs. 4.1 and 4.2. a-c: The conditional expectation values 
(Jj) are shown for the exact uncontrolled state (blue) with the la regions of 
confidence (grey). Also shown are the projection filtered values |xj (red), d: 
The squeezing in the exact uncontrolled state (blue) and the squeezing reported 
by the projection filter (red), e-h: Same as a-d but with the controls now applied. 



filter and the expectation value shown in Figs. 4.1 and 4.1. Figs. 4. 3. a - 4.3.c and 
4.3.e - 4.3.g re-plot the true conditional expectation values, (Jj), as well as the pro- 
jection filter values, given simply Fig. 4. 3. a shows that as the uncontrolled 
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state becomes significantly squeezed, the (J^) value reduces accordingly. The projec- 
tion filter is unable to account for this and therefore has a noticeable error. Fig. 4.3.C 
shows that after a time t ~ 0.05 k~^, there is an increase in the difference between 
the conditional expectation value (J^) and the value calculated from the projection 
filter. These differences are in stark contrast to the tracking results in Figs. 4.3.e - 
4.3.g where the differences between all three expectation values are almost all within 
the line thicknesses. Figs. 4.3.d and 4.3.h emphasise the fact that the projection 
filter reports separable states and so the projected squeezing parameter remains 
fixed at dB. 

Beyond these two sample trajectories, we also test the quality of the projection 
filter for a variety of initial states and qubit numbers. This comparison is made 
in Fig. 4.4, showing a trial averaged RMS error between the projection filter and 
the exact conditional expectation values. Here we test five different qubit values, 
n e {1,25,50,75,100}. For each n, the average is made over u — 100 input SCSs 
chosen at random with a uniform distribution over the Bloch sphere. These same 
Bloch angles are used for each n. For each input state we run a single simulation to 
compute the three exact conditional expectation values, (Jj), as well as the projection 
filter Bloch components x]. Then for each run we compute the RMS errors 



as a function of time. The expectation value {■)^ represents the athermic mean 
over the u trials. The normalization of the exact expectation value means that 
< Err Ji < 1 or in other words, is in units of the total spin length J. With the 
exception of the single qubit case (showing only numerical integration error), the 
scaled RMS errors are relatively independent of number of qubits. This is likely due 
to the fact that when the system is in a pure state, the projection filtering equations 
are independent of n (see Sec. 4.5.1). 

However, the presence of the strong randomized controls has a significant effect. 




(4.120) 
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Figure 4.4: Average RMS Tracking error vs time. Tlie left column sliows Err Jj, 
with i = x,y, z {in descending order) in the case of no control fields. The right 
column shows the same but in the presence of control fields. The average is over 
u = 100 uniformly random Bloch angles with a single noise realization per state. 
For each Bloch vector, the RMS error is computed for n = 1 (blue), n = 25 
(green), n = 50 (red), n = 75 (cyan) and n = 100 (purple) qubits. 



Sans the controls. Fig. 4.4 shows a near linear increase in Err J^, and Err Jy. In Fig. 
4. 3. a, the projection filter was unable to track the decrease in the ( J^) component as 
the uncontrolled state became squeezed and developed significant curvature on the 
sphere. Were the system initialized in a +ey spin coherent state, the roles of and 
Jy would be reversed but still have the same behavior. We attribute the increase 
in Err and Err Jy to this effect. In contrast, when the randomized controls are 
applied, the RMS error is equally distributed across all expectation values and remain 
< 5% of the total spin length. Additionally, the Err J^ values is significantly worse 
in the uncontrolled case. The ~ 1% error at time t = in the n = 100 simulations 
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is attributed to using Stirling's approximation to calculate the Jz basis coefficients 
in the initial SCS. 
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Figure 4.5: Average Projection Filter Fidelities. These plots show the average 
fidehty between the exact CSE simulation and the SCS given by the projection 
ffiter. The left plot shows the fidelity in the uncontrolled case with the right 
adding the 40 7r/2 gates per simulation. The average is over u = 100 uniformly 
random Bloch angles and a single noise realization per state. For each Bloch 
vector, the average fidelity is computed for n = 1 (blue), n = 25 (green), n = 50 
(red), n = 75 (cyan) and n = 100 (purple) qubits. 



While these results indicate that the projection filter performs well in the pres- 
ence of rapid, randomized rotations, an arbitrary spin state with J > ^ contains 
more information than simply three expectation values. To characterize the general 
performance, we turn to the second comparison and calculate the average fidelity 
between the exact state and a SCS given by the projection filter. Fig. 4.5 makes 
this comparison, averaged over u = 100 uniformly sampled states. This is made 
both with and without controls and again for n = {1,25,50,75,100} qubits. The 
overall state fidelity for SCSs with n > 1 shows a poor performance as the number 
of qubits increases, indicating an increase in the squeezing produced during the fixed 
measurement duration. In the worse case with n = 100 and no controls applied, the 
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average fidelity reaches a minimum value ~ 0.47. However with the controls, the 
fidelity is > 0.80 for any n. The non-monotonic decrease in the controlled fidelity 
suggests that the specifics of the control law impacts this fidelity and so it might be 
possible to optimize the control law so that the average fidelity is maximized. 
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Chapter 5 

Qubit State Reconstruction 



This chapter describes how to use the quantum filtering formahsm to construct a 
tomographic estimate for an unknown initial quantum state from an ensemble of 
identical copies experiencing a joint continuous measurement. We make a maximum 
likelihood estimate of this state, based upon the statistics of a continuous measure- 
ment of an output field quadrature. The purpose of this work is to extend previous 
results [11-13] into a regime where the quantum backaction significantly effects the 
measurement statistics. 

We consider here the case of an ensemble of n qubits coupled to a single traveling 
wave quantum light field. The qubit ensemble is assumed to be in a pure spin coherent 
state characterized by the unknown polar angles {9, 0). The quantum state estima- 
tion problem is mapped to a parameter estimation problem, which is then approxi- 
mated by a Monte Carlo sampling algorithm. Numerical experiments show that the 
ultimate performance of the estimate approaches an optimum fidelity bound, found 
by Massar and Popescu [39]. The deficit in the reconstruction fidelity is attributed 
to a separability approximation in the Monte Carlo algorithm. This algorithm is 
compared to, and significantly out performs, an equivalent "Schrodinger" estimate 
that ignores the backaction of the measurement. At long times the Schrodinger es- 
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timate is shown to be biased away from the true state, indicating the significance of 
the conditional dynamics and the utihty of the quantum filtering framework. 

5.1 Previous reconstruction results 

A fundamental task in quantum information processing is the ability to both reliably 
prepare an arbitrary quantum state and to experimentally verify its production. 
Traditional quantum state estimation relies on an exhaustive tomographic procedure 
where the target state is repeatedly prepared and then destructively measured in 
an informationally complete number of measurement settings. Such a procedure is 
often extremely time intensive, requiring both a tremendous amount of data as well 
as significant post processing time [2, 3]. 

In an alternative protocol proposed by Silberfarb et al., these inefficiencies can 
be largely side-stepped though a weak continuous measurement of an identically 
prepared ensemble in conjunction with a well chosen dynamical control [11]. In 
particular, an atomic ensemble is prepared in an identical tensor product state ptot = 
Po " s-nd experiences a known Hamiltonian while simultaneously coupled to a traveling 
wave probe, via a collective degree of freedom. A continuous measurement of this 
probe then generates a measurement record that is strongly correlated with the 
evolution of the system. If the dynamic drives the system in such a way as to make 
the measurement informationally complete, then a statistical estimate of an unknown 
initial system state should have a high fidelity with the true initial condition. 

Such a system naturally arises in the field of laser cooled atoms, were an ensemble 
of n atoms are easily assembled and then weakly coupled to an off-resonant probe 
laser. One can then measure a collective spin state of the ensemble via the amount 
of polarization rotation induced by the Faraday effect. This protocol has been im- 
plemented in several experiments, ultimately reconstructing the full 16-dimensional 
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hyperfine ground state manifold [12, 13]. However, they were performed in a param- 
eter regime where the intrinsically quantum nature of the continuous measurement 
could be ignored. The amount of state disturbance caused by the nonlinear mea- 
surement process, the so called backaction, was negligibly small when compared to 
the decoherence induced by diffuse light scattering as well as inhomogeneous effects 
in the control fields. 

This work investigates, through theoretical analysis and numerical simulation, 
the fundamental limits of this protocol. We do so in an idealized model, where 
the effects of decoherence are absent and thus the backaction becomes a significant 
effect. To avoid unnecessary complications, we will also reduce the dimensionality 
of our fundamental system and consider only pure qubits, initialized in an identical 
tensor product state |'?/'tot) = iV'o)®"- With a fully quantum model of the atom-light 
interaction, we formulate a maximum likelihood (ML) estimate of the single particle 
initial state, which we will denote as \ipo). 

5.2 The Estimation Procedure 

When ignoring backaction, the linearity of an unconditioned master equation means 
that the measurement signal can be considered as a linear function of the initial 
state of the atomic system, po- In additive white noise model, the instantaneous 
polarimetry signal y{t) can be modeled as, 

y{t) = gTr {V{t, Oq) po) + "white noise" (5.1) 

where g is a measurement gain relating to the signal-to-noise ratio, V{t, •) is the 
Heisenberg picture equivalent to a dissipative master equation and Oq is the initial 
system coupling observable [13]. The problem of state reconstruction in this model 
then becomes a constrained linear estimation problem. 

In a generalized measurement model, the set of possible outcomes is described 
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by a positive operator-valued measure (POVM), with elements {Ea} indexed by a 
discrete outcome a. In a given model of this measurement there exist a (possibly not 
unique) decomposition of a POVM into a set of Kraus operators {Aq,}, which satisfy 
the relation — A]^ A^, for every outcome a. Then upon obtaining the outcome a, 
a pure state j'^) updates via the transformation 



Due to the renormalization factor, this update map is inherently nonlinear in the state 
vector. Any generalized measurement scheme can be decomposed into a continuous 
measurement process [73]. Conversely, a continuous measurement process can be 
modeled as a limiting sequence of weak generalized measurements. Then in general, 
the nonlinearity of a repeated application of a time-dependent update map means 
that a measurement sequence is no longer a linear functional of the initial state po- 

Much is known about the fundamental quantum limits of reconstructing pure 
qubit states from a finite number of measurements. Massar and Popescu showed that 
given n copies of a pure qubit state, it is possible to find a generalized measurement 
that optimizes the average fidelity (T) between of the state estimate and the true 
state, averaged over all possible input states [39]. The fidehty for the pure states V'l 
and ■02 is 



(For mixed states this corresponds to the Uhlmann fidehty, but here we will only 
be concerned with pure states.) With this definition the optimum average fidelity 
bound is simply given 



They also showed that such a generalized measurement is necessarily a joint mea- 
surement involving all n qubits, and no single measurement applied in series to each 




(5.2) 




(5.3) 




(5.4) 
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qubit can achieve this bound. Later, Bagan et al. found that a generahzed mea- 
surement scheme that achieves this bound is a measurement that is uniform over all 
possible spin coherent states (SCS) composed from n qubits [74]. While Varbanov 
and Brun gave a constructive proof for a continuous time stochastic process that re- 
produces a given generalize measurement, it is often quite difficult to obtain a closed 
form expression for what POVM the entirety of a given continuous measurement 
implements. Instead of pursuing this track however, we instead turn to a Monte 
Carlo sampling framework. 

At its most basic level, the initial state estimation problem is a parameter estima- 
tion problem, in that we observe a time varying signal whose statistics parametrically 
depend upon the initial state of the atomic system. The simplest of all initial state 
estimation problems is binary state discrimination. In this problem, the initial con- 
dition is know to be one of two possibilities, -0" or -0^. Then based upon a sequence 
of measurements, {y*}, we wish to identify which state was most likely to generate 
these data. 

In our more general problem, we have a data set {yt} and a detailed model of 
the dynamical system that generated the data, but only with the knowledge that the 
initial state is a SCS. To deal with the continuous nature of this parameter estimation 
problem we resort to Monte Carlo sampling. We randomly generate a collection of 
m sample SCS, {ip^ : j = l,...m}, picked from some prior distribution. In Sec. 
5.4 we describe how we choose the prior distribution though a two step resampling 
procedure, seeded from a uniform distribution over spherical angles. Because the 
space of qubit SCS is isomorphic to the surface of the sphere, with just a few hundred 
samples we can easily cover that space so that any discretization error is well below 
the infidelity implied by the optimum bound (J-')opt- 

Irrespective of how the candidate states are chosen, we have reduced the contin- 
uous parameter estimation problem to a much simpler state discrimination problem. 
We will choose the state ) G {\^"')} that maximizes the likelihood function 
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P{{yt} In other words, the ML state defined as 



1^) = |V> e {IV"^)} : p{{yt} IV) = argmax{p({y,} IV'"^) } . 



(5.5) 



m 



In order to evaluate the hkehhood function, we are still left with the problem of solv- 
ing the recursive POVM expression or finding an equivalent method for calculating 
it. 

Here we choose to formulate an equivalent expression. Because we are working 
with a finite set of hypothesis states, we find that it is more efficient to propagate m 
(approximate) conditional states from their initial values and calculate the likelihood 

for seeing the next increment, given the current estimates. This method is discussed 
in detail in Sec. 5.4. 



Sec. 2.7 reviews how the Faraday interaction can be modeled as a collective angular 
momentum J coupled to a single P quadrature in vacuum. We align our coordinates 
so that wc couple to the Jz projection of angular momentum, with the collective 
angular momentum operators 



where cr- is the i*'* Paufi operator for the j*'* qubit and we have set H — 1. The 
coupling rate k between and P is proportional to the local power in the drive 
laser field, which in general could be a time varying quantity. For simplicity, we will 
assume that the laser is operated in a switched mode, where at time i = it achieves 
a constant value and that the measurement record ends before it is turned off. 

In order to make the measurement record informationally complete, (or in the 
language of filter stability, make the system observable), we need to add an external 



5.3 The Model 




(5.6) 
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control Hamiltonian Ht, acting solely on the collective spin system. The exact form 
for Ht to make it observable will be discussed in Sec. 5.3.1. Under these parameters 
the system field interaction is given by the unitary propagator Ut, which is the 
solution to the QSDE 

dUt = (yK dA\ -^fkJ^ dAt - \k Jl dt - iHt dt^ Ut, Uq = 1. (5.7) 

From this stochastic propagator we arc able to apply the results of Sec. 3.3 and work 
with a conditional master equation (CME). For reference, upon the receipt of the 
measurement realization {yt}t>o, the CME for this model is given by the SDE 

dpt = -i[Ht, pt] dt + nV[J,]{pt) dt + y/Jin[J,]{pt) dvt (5.8) 

with the initial condition po = p(0), where we have the following definitions. V[Jz]{pt) 
is the Lindblad map commonly found in open quantum systems and is defined as 

V[J,]{pt) = J, Pt J. - ^ Pt - \pt Jl (5.9) 

T^[Jz\{pt) is the state update map defined as 

H[J,] (pt) = J,pt + ptJ,-2 IV( J, Pt) Pt. (5.10) 

This map shows how the state updates, weighted by the strength of the innovation 
process, 

dvt = dyt - 2^Tr{J,pt) dt. (5.11) 



5.3.1 Observability and randomized controls. 

In reconstructing the full Cs ground state manifold, Riofn'o et al. used a random- 
ized control policy to generate an informationally complete measurement record [13]. 
Merkel et al. showed that by combining traverse RF magnetic fields and microwave 
radiation, with fixed magnitudes and time varying phases, the 16-dimensional ground 
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state manifold is controllable [75]. In other words, through these fundamental op- 
erations it is possible to generate any ground state operation and thereby map any 
state to any other state. 

The connection between controllability and observability is a natural one. Imag- 
ine that at time t — the probe couples to the operator J^. In order for the 
measurement statistics of this probe to depend upon the Jy Bloch component, an 
external control must at some point rotate the system so that field now couples to 
the part of Hilbert space spanned by the projectors of Jy. If the controls are unable 
to effect some hidden subspace, then the only other way to know about that part 
of Hilbert space is to apply an additional probe. Not every observable system needs 
to be controllable, however. One can certainly observe a system completely without 
being able to affect it in an arbitrary way. 

The strictest definition for a system to be observable is that if there are two 
quantum states and where p^ p^ then there cannot exist a projector P in 
the von Neumann algebra generated by the observation process {5^}i>o such that 
Tr(p^ P) = Tr(p^ P) [76]. (See Sec. 3.2.2 for a discussion of von Neumann algebras 
and quantum stochastic processes.) This definition guarantees that after many trials, 
one will always be able to distinguish p^ from p^ by looking at the statistics of Y. 

However, even if a given system is observable, this does not guarantee that it 
is well observed in a given measurement realization. In order for the statistics of a 
single realization to give a high fidelity estimate, the space of possible initial states, 
e.g. the space of all spin coherent states, should be well represented throughout the 
measurement record. If the goal was to measure Jz to a high degree of accuracy, the 
optimum control policy would be to apply no control at all. However our objective is 
to measure every spin coherent state with equal weight, there by hopefully achieving 
the optimum POVM fidelity bound. 

Riofrio et al. found that high fidelity reconstructions were possible by choosing 
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random, piecewise constant phase angles, thereby randomly cycling though a con- 
trollable set of operations. Here wc choose to implement a control policy that is 
randomized between a set of generators that rapidly spans the space of spin coherent 
states. This policy then guarantees that these states will be well represented in the 
measurement statistics. To achieve this, the control Hamiltonian Ht is chosen to 
have the form 



where the control field components b^{t) are drawn from a random distribution but 
are predetermined before the start of the measurement, i.e. are without measurement 
feedback. 

For simplicity, we further emulate the control policy of the Cs experiments and 
fix the magnitude of the control field while varying its direction in a randomized 
but piecewise constant way. Furthermore we will constrain the magnitude so that 
for each direction, the Bloch vector will rotate by 7r/2. Switching the field direction 
with a period of r then requires = it/{2t). With this constraint, the control 

law is fully defined. 

To generate a control waveform with m randomized 7r/2 gates with a period r, 
we first generate a set of m of unit vectors {cj} so that each vector is drawn from 
a uniform distribution across the unit sphere. The control field is then 



Ht = b{t) ■ J = + by{t)jy + h\t)j, 



(5.12) 



m 




(5.13) 



i=l 



5.4 The Likelihood Function 



In a discrete setting where the space of all possible outcomes, (the entire measurement 
record {yt}t>o) can only have a finite number of outcomes, the likelihood function 
is simply the probability of receiving the observed values, given a parameter value. 
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The maximum likelihood estimate is then the parameter value that maximizes the 
probability for obtaining the observed data. When the measurement takes on a 
continuous number of outcomes the probability for receiving a specific outcome is in 
fact zero. However, we can still formulate a likelihood function by instead considering 
the probability density for the observed value. 

Things become a bit more complicated when considering stochastic processes in 
continuous time. In Chap. 3, we found that the probability measure for a Wiener 
process was defined by Wiener's discrete path integral. This means that for a se- 
quence of n times {0 — to < ■ ■ ■ < ti ■ ■ ■ < tn — t f} we can ask for the probability 
that the Wiener process evaluated at time ti will be within the interval (a^, bi). The 
resulting probability is given by the integral 

P({.„ . ,}) ^ £ d.. £ ...... n exp (- toO!)) . (5.14) 

If one attempts to take a continuous limit of this expression you find something 
rather peculiar [77]. By focusing on just the product of exponentials, one finds 




Were the Wiener process in anyway differentiable, this expression might be exceed- 
ingly useful. However with our limited knowledge of stochastic analysis, it merely 
indicates the subtleties in working with densities of continuous time, nondifferen- 
tiable processes. Attempting to make sense of these kinds of objects lead to the 
formulation of a stochastic calculus of variations, which has proved exceedingly use- 
ful for extending an Ito integral for anticipative integrands [78] as well as a theory 
of white noise stochastic partial differential equations [21]. We will not follow this 
path here. 

There is an additional consideration as we know that {yt}t>Q is decidedly not 
a Winer process. Even making a discrete approximation, we still need to find an 
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expression for the discrete density and how it depends upon the initial system state. 
In this problem, we can make some progress. Sec. 3.4.1 showed that the innovation 
process, Vt — yt — dsTi{JzPs) is an instance of a Wiener process. More 

specifically, Vt is a Wiener process when the filtered state Ps accurately represents 
the conditional state of the system. In our Monte Carlo setting we do not have just 
a single conditional state pt, we in fact have a set of m conditional states {p^}, as 
the proper initial condition is unknown. It is possible that a candidate state p^ will 
differ in some aspects from the conditional state we would calculate, had we know 
then true initial condition. For each hypothetical state we will have a set of possible 
innovations {v^}, each a function of the measurement record yt and the filtered state 
pj". It should be clear that not every v'^ will be an instance of a Wiener process. In 
fact the maximum likelihood estimate that we will construct hinges upon the fact 
that not every will be a Wiener process. This is because rather than computing 
the entire unknown and highly complicated statistics of yt, we will compute the 
statistics of the known and simple statistics of the Wiener process Vt- We then seek 
the candidate initial condition that makes the statistics of v'^ most resemble a Wiener 
process. 

Stepping back from the mathematics for a moment, converting Eq. (5.14) into 
an expression for p{{yt} \p"^) via the innovation is deceptively simple. We can write 
Vt as 




(5.16) 



or in other words, 

rU 

'^Vi = yti - yti_i - dsTr{J^ps). (5.17) 

Jti-i 

Prom the nonanticipative construction of the Ito integral, we have, for the smallest 
of possible time differences. 



Avi^i ^ Ayi - 2^AtiTr{J,pt^_,). 



(5.18) 
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The density for {yt}t>o then made by simply substituting Avi into Eq. (5.14). In 
other words, the hkehhood for the increment variables {y^ = At/i} is then given by 

Pn(yi,y2,---,yn|p"') ~ 

' i\ f " (yi-20^At,Tr(J,pI" J)2\ 
n(27rAt,)-2 j exp f - J] ^-^ ^^AtT^^ ) ' ^^'^^^ 

The only possible way to maximizing Eq. (5.19) with respect to the initial condition 
is by minimizing the argument of the exponential. If we set all of the time 
increments to be equal, Ati — At, we can even factor out the denominator and so 
the maximum likelihood estimate then becomes a problem of minimizing the sum, 

n n 2 

QV«) ^ ^(AO^ = Yl i^y^ - 2^^^ ^(^^ Puj) ■ (5-20) 

1=1 i=l 

This kind of object is called the quadratic variation^ and Appendix B showed that 
it is ultimately what gives rise to the rules of Ito calculus. So while it is a relatively 
delicate mathematical object, it is well defined in the infinitesimal limit. Further- 
more, in proving the Ito rules, one shows that QV(w() = t with probability one, 
so that we expect, and observe numerically, that QV(w™) ~ t for most candidate 
initial conditions. It is often the case that in Guassian problems such as ours, a 
maximum likelihood estimate over a Gaussian probability density simple becomes 
the least squared estimate. So in our Monte Carlo search we have 

argmaxp(yt|p"^) = argminQV(v7*). (5-21) 

m m 

5.4.1 The reconstruction procedure 

The Monte Carlo sampling estimator we have outlined follows this rough procedure: 
1. Sample m pure Bloch vectors uniformly from the unit sphere. 



^Technically the quadratic variation is given in the infinitesimal limit [22]. 
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2. For each sample state compute the forward time evolution conditional on the 
measurement record yt- 

3. Compute the quadratic variations of the innovation processes for each condi- 
tional state. 

4. Select as the estimate, the sample state that minimizes the quadratic variation 
at the final time. 

In practice we need to modify this procedure in two respects. The first is that due 
to involving the stabihty of Markov Filters [76], the above procedure suffers from 
poor numerical stability when the hypothesis initial condition has very little overlap 
with the true initial condition. To rectify this problem, we implement a two step 
procedure, by first sampling m mixed initial conditions and then resampling, within 
some solid angle, pure states about the direction of the most probable mixed state. 
This issue will be discussed in detail in Sec. 5.4.2. 

The second modification stems from the fact that propagating the full conditional 
Schrodinger equation for a sufficient number of samples requires a large amount of 
computer time. To fully propagate a spin J pure state requires 2J + 1 complex 
numbers. The stochastic integrator we choose to use implements a weak second- 
order predictor-corrector method ( Kloeden et al. [69, page 200] ) and empirically 
requires a time step At ~ 10^^ to produce reliable expectation values. When 
considering ensembles of mixed qubits it is not sufficient to consider the maximum 
projection of the collective angular momentum, but instead requires considering all 
possible total angular momentum values one could construct with n spin-| particles. 
This requires a total density matrix of order x in size [79] . 

In Chap. 4 we developed a projection filter that projected the conditional master 
equation for the collection of n qubits onto the manifold of identical separable states, 
which greatly reduces the computation demand. We also showed that in the presence 
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of strong randomized control, the projection filter tends to track the exact expecta- 
tion values with a RMS error of less than 5% of the total spin length. For mixed 
initial conditions, rather than propagating matrices of dimension ~ x for each 
sample state, the projection filter allows us to reduce this to tracking a single mixed 
Bloch vector, i.e. three real numbers. With these modifications, the Monte Carlo 
separable least squares estimate is computed though the pseudocode algorithm 5.1. 

Algorithm 5.1 A Monte Carlo Separable Least Squares Estimate 
{ Tm } < — m uniformly random Bloch vectors with r = Tmixed < 1 
for all r„ e { } do 

i — Integrate Eq. (4.105) with record yt and initial value = r^. 
QV(^;-) ^ 5^. (Ay, - v^n xf- At, f 
end for 

rmin < fm' e { I'm } : QV(vJ"' ^) = mlu { QV(f ™) } 

{ r'^} i — m random Bloch vectors with r' = 1 and r'^ • rmin /rmixed < cos(0max) 
for all r;„ e {r;„} do 

r'^ i — Integrate Eq. (4.108) with record yt and initial value rg^ = i"^- 

qW{vT) ^ i^Vi - v^n X?- Ati f 
end for 

rU ^ C e { r;„ } : qY{vt^'^) = min { QV(t;r) } 
return p{y'^,^) 



5.4.2 Coupled CMEs 

and filter stability 

For a given hypothesis state p"^, there is a hidden "true" state p* that generated 
the measurement record yt- When using this measurement record to propagate the 
hypothesis state, that ends up couphng to p*. This leads to a coupled set of 
stochastic differential equations. By definition, the conditional state results in an 
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innovations process that is a true Wiener process. Explicitly, is evolves according 
to the stochastic differential equations, 

dpt = - i[Ht,p*] dt + kV[J,]{p*) dt + ^/k'H[J^]{p\) dwt, (5.22a) 
dyt = dwt + 2^TY{J^pl)dt. (5.22b) 

where Wt is the correct innovation process and is an unobserved Wiener process. 
The candidate initial condition p^, and the measurement record yt are the elements 
necessary for propagating p^ in time via, 

dpT = - i[Ht, pT] dt + kV[J,]{pJ') dt + 0^?/[J,](p™) dvT, (5.23a) 
dv'l' = dyt - Tr((L + L^)pT)dt. (5.23b) 

In terms of the unobserved Wiener process Wt, evolves as, 

= - ^[Ht, pT] dt + kV[J,] (p™) d.t + V^n[J,] (pD dwt (5.24a) 
-2Kn[J,]{pT)Tr{UpT-pt))dt. 

Note that if at some time t we happen to have the equality p^ = pj", then we also 
have dp* = dp^. But the general state Pr at some time t > t can be written as 
Pt = Pt + ft dpt'. This implies that because the differentials are equal whenever the 
states are equal we will have p* = p^ for every t > til p^ — p^. 

One possible result of this coupling is that it acts as an attractor, always decreas- 
ing the "distance" between the Jz projections of p^ and p™. This correction effect 
is known as filter stability. If the filter is able to correct for certain modeling errors, 
it is stable. The differences in the two initial states Pq and p^ can be viewed as a 
modehng error and the convergence of p^ — > p^ is a correction of this error. This is a 
well studied effect in both the quantum and classical settings, see [76] and references 
there in. 

In [76], Van Handel gave explicit criteria for when a quantum filter is stable for 
an incorrect initial conditions. For our purposes these criteria boiled down to the 
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following two issues. The first is that the system must be observable, in that the 
measurement record must be informationally complete. If we did not have a trans- 
verse magnetic field, then the measurement statistics would only include information 
about the eigenstates of and so the system is not observable. The second issue is 
that the probability density for yt calculated with the true state p* must be absolutely 
continuous with respect to the density calculated under the guessed state p"^. This 
is a term borrowed from classical probability theory and embodies the concept that 
a probability measure is compatible with observations that are actually governed 
by P"^. The quantum version is given by the following definition. In order for p"^ 
to be absolutely continuous with respect to p^, then for any projector P in the von 
Neumann algebra generated by {5^t}j>o, we must have Tr(P p^) — implying that 
Tr(P p^) — 0. This need not be a two sided relationship so that p^ need not be abso- 
lutely continuous with respect to p'^. These requirements are not just important to 
the question of filter stability but also apply to the Monte Carlo sampling procedure. 
As it has been discussed previously, the observability condition is vital in order to 
obtain a high fidelity estimate. However absolute continuity is also quite important. 

In a Kraus operator formulation of a continuous measurement, the state after the 
measurement outcome i is updated as 

If the denominator Tr{AlAip) — 0, then the update cannot be made as it requires 
dividing by 0. However 'Tr{AlAip) is also the probability for obtaining the outcome 
i, as calculated according to the state p. Therefore, if the event {i} occurs with this 
probability, dividing by zero is not an issue as it will never happen. Suppose we obtain 
the outcome i that occurred with probability TrlAjAip*) = p*. Furthermore, suppose 
we tried to update a state p"* that had the audacity to assert pf^ — TY{AlAip"^) — 0. 
This results in a crisis of conscious, as there is no way to incorporate this incompatible 
information into our world view. The condition that p* must be absolutely continuous 
with respect to p"* means that pf^ will never be zero without p* also equal to zero. 
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In principle any valid initial spin state could generate a given diffusive measure- 
ment record. This can be easily seen by noting that the "true" innovations process 
is given by 



^0 

and is a Brownian motion. Because Jz takes on eigenvalues in the range —n/2 < 
rriz < n/2, any candidate innovation will be within the range, 



For finite, n, k, and t, it is perfectly possible for a Brownian motion to obtain any 
of these values, it is just increasingly unlikely. Therefore if the measurement record 
is observable, the conditional master equation is in principle stable. 

In practice, the numerical stability of states conditioned on highly improbable 
measurements becomes a issue. By not taking this into account preliminary results 
that did not consider the possibilities of unstable trajectories, showed nearly a 1% 
drop in the average reconstruction fidelity from what we ultimately achieve. Inves- 
tigating the cause of this sub-optimal performance showed that the average fidelity 
was significantly biased by outlier trajectories that gave estimated states that were 
nearly orthogonal to the true state. The cause of these outliers was the numerical 
stability of Monte Carlo sample points with very poor overlap with the true state. 

By switching to the two step sampling procedure in algorithm 5.1, every initial 
mixed single qubit state can be viewed as a convex combination of pure states point- 
ing along opposite directions. That is, if we have the possibly mixed single qubit 
Bloch vector r with length < r < 1 we have 



where p(er) is a projector on the the pure SCS pointing in the Br direction. This 
implies that by using initial mixed vectors each initial state has some support over 




(5.26) 




(5.27) 




(5.28) 
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the orthogonal spin coherent state ijj{e^r)'^"'. In the numerical simulations presented 
in Sec. 5.5 an initial mixed state vector of radius Tmixed = | provides enough of 
a signal to choose an appropriate direction for the pure state resample as well as 
enough orthogonal support for the trajectories remain stable. 

5.4.3 Backaction in continuous quantum measurement 

In order to identify what impact the backaction has on the reconstruction fidelity, we 
need to construct a similar but backaction-free estimator. A figure of merit commonly 
used to consider the importance of backaction is the ratio of the "projection noise" to 
the "shot-noise". The projection noise is a description of the fluctuations {i.e. noise) 
in a given observable if a projective measurement is made. As we are considering a 
continuous measurement of with respect to a SCS, the relevant projection noise 
is 

{^J')^.n = 1 <A<7,2>^ = np^^{l - p^,) (5.29) 

where p+i = |(-|-1|'^)|^ is the probability to observe the individual spin state to be 
in the +1 eigenstate of [4]. 

The shot-noise describes the noise added by making a continuous measurement 
over a finite time. To identify the order of magnitude of this additive noise, note 
that from Eq. (5.26) we have 

yt = V* + 2V^ / ds Tr(p: J,) (5.30) 
Jo 

and that is a realization of Brownian motion. We would like to invert this formula 
to arrive at a random variable whose statistics allow for an estimate of Tr(pQ J^). 
Suppose that we wished to model the system completely ignoring the theory of 
continuous quantum measurement and that for times < s < t, p* evolves according 
to the Schrodinger equation. If we further assume that Hf = 0, we then have that 
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Tr(p* Jz) = Tr(po Jz) and so the classical random variable 



Tr(J.pS) + 



1 



(5.31) 



h = 



2y/Kt 



is Gaussian distributed with mean Tr{JzPQ) and Var(j2;) = (Ant)'^. It is this vari- 
ance that is referred to as the shot-noise added by the probe. Looking at the ratio 

of these two fluctuations we have 



then expect a maximum amount of spin squeezing or equivalently a large amount of 
bipartite entanglement. In this case (AJ^)^^,^ takes on its maximum value of n/4 
and so ( — n Kt. When C ^ 1 then one expects a significant contribution of quantum 
backaction in the system and therefore the measurement effects must be accounted 
for [5]. In the uncontrolled spin squeezing simulations of Sec. 4.6.3, we found that 
for n — 100 and nt — 0.2 we found ~ 10 dB and so that in the absence of strong 
Hamiltonian controls, C = 20 indeed leads to a strongly nonclassical state. 

However, the above discussion assumed no controls. It is possible that with 
the randomized controls considering only the Hamiltonian evolution is sufficient to 
obtain a high fidelity estimate. To make this comparison we formulate a backaction- 

free estimator, one that only includes the Hamiltonian in the model for the forward 
time dynamics. Rather than considering a measurement record where yt is given by 
Eq. (5.30), we instead propose a model were 




(5.32) 



If the system is prepared in a SCS with p+i 



I then Sec. 4.6.3 showed that we 




(5.33) 



and p*{t) is the solution to the Schrodinger equation 



dt 



(5.34) 



and Wt is a Wiener process. 
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To make a fair comparison, this backaction-free estimator will also be imple- 
mented though a Monte Carlo sampling procedure. We use a algorithm similar to 
algorithm 5.1, but with two modifications. The first is that the two step sampling 
procedure is unnecessary because there are no conditional dynamics to cause numer- 
ical stability. The second is that because the dynamics are linear, the Schrodinger 
evolution in Eq. (5.34) is most efficiently computed in the Heisenberg picture. In 
the Heisenberg picture, we simply need to integrate the time evolution of the 
observable once and then compute its expectation value with each candidate state. 
Furthermore in this decoherence free model the system state will always remain in a 
separable state and so we need only consider the Heisenberg evolution for the single 
qubit Pauli operator, a^- In other words. 




(5.35) 



where az{t) is the solution to the Heisenberg equation of motion 



di 



(^z{t) = +i[Ht, CTzit)], with 0-^(0) = a^. 



(5.36) 



The pseudocode for the backaction-free estimator is given in algorithm 5.2 



Algorithm 5.2 A Monte Carlo Backaction-free Estimate 



Wz{ti)} < — Integrate Eq. (5.36) and evaluate at times {ti}. 
{ I'm } < — uniformly random Bloch vectors with r — 1 
for all r^ e { r^ } do 



QV«) ^ i^Vi -V^ri Tr(a,(t,_i) p{rj ) At, f 



end for 



r, 



min 



r^'e{r^}: QV(^^')) 



min { QV«) } 



return f>(r,,,i,, ) 
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5.5 Numeric Simulations 

This section presents the results of numerical simulations, comparing algorithms 
5.1 and 5.2 to the optimum POVM bound in Eq. (5.4). The bound (J") opt = 
{n + l)/{n + 2) gives the average fidehty of a single POVM where the average is 
taken over measurement outcomes as well as an average over possible input SCS. 
Therefore, the results of these simulations are reported as an average of ensemble of 
u trials. All results in this section use u = 1000 trials. 

5.5.1 Simulation parameters 

For each trial, we choose a single qubit Bloch vector from a distribution that is 
uniform over the surface of the unit sphere. We then use this vector to gener- 
ate SCSs composed of n qubits. This simulations use the qubit numbers n — 
25,40,55,70,85, and 100. Then for each initial state and each number of qubits, 
we generate a single measurement realization yt and use this record to then estimate 
the initial Bloch vector. In total 6000 measurement records were generated. 

Every simulation uses the same control Hamiltonian, where the randomized piece- 
wise constant control vector b{t) was generated at the start of the simulation. The 
directions of rotation arc again distributed uniformly across the unit sphere and no 
attempt was made to select an optimum realization. The parameters that fully con- 
strains the simulation are the measurement strength k, the final measurement time 
tf and the control gate period r. With no other scales in the problem we choose to 
essentially set k to one and discuss the remaining two parameters in units of k~^. 

In Chap. 4 we found that the separable approximation is valid in regime where 
the randomizing magnetic field strength k <^ Bq. By fixing the strength to generate 
a 7r/2 rotation in one gate period r this means that bo — 7r/(2T), implying that 
K,T <^ 1. We also found that a gate period r — 5 x 10~^k,''^ gave less than a 5% 
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RMS tracking error for the separable projection filter, (see Sec. 4.6), and places bo 
two orders of magnitude greater than k. 

For n = 25 — 100 qubits we find that the reconstruction fidelities have saturated 
by a time i ~ = 0.2 k"^, which we fix as final time for every simulation run. With 
this final time and gate period, each simulation has 40 randomized 7r/2 rotations. 

To efficiently implement these simulations we exploit two conservation properties 
of the system. The first is that because the total angular momentum operator 
commutes with the stochastic unitary of Eq. (5.7), the total angular momentum of 
the system is conserved. This means that by initializing the system in a state of 
maximum projection of angular momentum {i.e. in a pure SCS) we are initializing 
the system in the eigenspace with total angular momentum J = n/2. Rather than 
considering the entire d = 2" dimensional Hilbert space we only need to simulate 
a spin J = n/2 particle and work in its = 2 J + 1 = n + 1 dimensional Hilbert 
space. The second property is that the conditional master equation we consider here 
maps pure states to other pure states, because it has no additional loss channel. This 
means that we can in fact integrate a conditional Schrodinger equation rather than 
a conditional master equation. This makes a substantial savings in computational 
overhead as we be propagating a. d = n + 1 complex vector in time, rather than a 
d X d complex matrix. These two properties that makes it computationally feasible 
to generate 1000 measurement records for a system containing 100 qubits. 

The actual simulations are implemented in the MATLAB computing environ- 
ment using a hand coded weak second-order predictor-corrector stochastic differen- 
tial equation integrator. The algorithm is described in Kloeden et al. [69, page 200] 
and was implemented in MATLAB by Brad Chase for his PhD dissertation [37]. 
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Monte Carlo Parameters 

The Monte Carlo separable estimator used 250 sample states for each part of the 
two-step estimation. In the initial step, the 250 mixed states produce a sparse but 
uniform covering of all possible SCS directions. A typical sampling has an average 
angular separation between adjacent points of ~ 6° and a maximum separation of 
~ 20°. As mentioned above, the mixed state radius of the Bloch vector used in these 
simulations is rmixcd = 0.75. In the second step, we sample 250 pure states that 
are constrained to be no more than 45° from the most likely mixed state direction. 
Example first and second step sampling distributions are shown in Fig. 5.1. 

Mixed State Distribution Resampled Distribution 




Figure 5.1: An Example Monte Carlo State Sampling Distribution, (left) An 
example of the initial mixed state sampling for the Monte Carlo algorithm, with 
m = 250 and rmixed = 0.75. (right) An example of resampling m = 250 states 
about the -t-e^; axis with 0max = 45°. 



For the backaction-free comparison, we use an number of samples matching the 
density of points in the second resample step. The resampled solid angle covers 
approximately 15% of the Bloch sphere, meaning that m = 1700 cover the whole 
sphere with roughly the same density of states. This number of samples lead to an 
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average fidelity between nearest neighbors of (J-')sampic = -9994, meaning that if the 
true ML estimate falls between two sample points, on average, the infidelity caused 
by the Monte Carlo sampling will be on the order of 10~^. This is well below the 
optimum POVM bound for the simulated qubit number and so any loss in fidelity 
should not be attributable to sampling errors. 



5.5.2 Results and discussions 
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Figure 5.2: A comparison of numerical reconstructions to the optimum bound 
(Color Onhne.) Data points show the average fidelity of single shot reconstruc- 
tion as a function of the number of qubits n, averaged over i/ = 1000 randomly 
chosen pure initial states states. Blue circles show the separable estimator. Green 
diamonds show the backaction-free Schrodinger equation estimator. The opti- 
mum POVM bound is show in as a dotted line. Error bars show a standard error 
of±^Vai[T]/iy. 



Fig. 5.2 shows the results of the numerical simulations. The trial-averaged re- 
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construction fidelity is plotted as a function of the number of qubits in the system 
for both the separable estimate {i.e. with backaction) and the Schrodinger evolved, 
backaction-free estimator. The fidelity is computed by taking the squared overlap 
between the single qubit state for that measurement record with the single qubit 
state estimate. In other words, if the true qubit state is given by the Bloch vector ro 
and the estimate reports the Bloch vector then the fidelity of that reconstruction 
is given by J" = |(1 + tq • ). 

In these numerical experiments, the separable Monte Carlo estimator shows a 
significant improvement over a simple backaction-free estimator that considers only 
the unitary evolution of the state due to the control fields. The discrepancy increases 
as the number of qubits increase, keeping the duration of the measurement fixed. 

Furthermore, the separable estimator almost achieves the optimum bound. The 
deficit between the bound and the numerical averages never exceeds 0.21% with an 
average of 0.16%, which is still above the expected error caused by the discrete Monte 
Carlo sampling. A possible source for this deficit could be the separability assumption 
in the projection filtering method, which is known to have a non-negligible tracking 
error in the Jz expectation value (see Sec. 4.6). 

The performance of the backaction-free Schrodinger estimator is best understood 
by considering not just the estimate for the initial state given the entire measurement 
record, but to instead consider the family of estimates created by only taking part 
of the measurement record. 

Estimator Bias 

The Monte Carlo estimators take as input a measurement record y containing data 
for times t e [0, tf] and returns an estimate for the initial state pq. It is just as easy to 
consider a whole family estimates computed with only part of the total measurement 
record, i.e., instead of using the entirety of y we use i/^ for < s < in computing 
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the estimate. Ideally, having more data should only improve the estimate. However, 
in order to use the data at times t > s we are required to compute an estimate for 
the state of the system at time s. If this estimate is in fact inaccurate, then any 
modeling errors might bias the conclusions drawn from future measurements. 



Moreover both of the estimators considered here have modeling errors. The sep- 
arable estimator uses the projection filtering equations, which explicitly remove any 
entangling dynamics. The estimator based simply upon the unitary Schrodinger dy- 
namics makes a much greater sin. This estimate completely ignores any effect the 
measurement has on the system of qubits. Figures 5.3 and 5.4 indicate what affect 
these modeling errors have on the average reconstruction fidelity. 
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Figure 5.3: Reconstruction fidelity vs measurement duration. This plot shows 
the average reconstruction fidelities for the separable filter estimate, as a function 
of the length of the measurement record. Shown are traces for the 6 qubit 
numbers considered, which are (in order of decreasing reconstruction fidelity) 
100, 85, 70, 55, 40, and 25 qubits respectively. The vertical axis is a linear scale, 
with grid lines indicating the optimum fidelity bound for these same number of 
qubits. The averaging was over u = 1000 randomly chosen pure initial states. 
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Fig. 5.3 shows for the separable filter, the trial averaged reconstruction fidelities 
for all 6 qubit numbers plotted against the duration of the measurement record. 
It is clear from this figure that having a larger signal composed of more qubits 
improves the final fidelity. It also shows how, as the number of qubits increases, the 
fidelity improves at a faster rate. Furthermore, the modeling error introduced by the 
separable approximation does not seem to significantly bias the estimate away from 
an optimum sample state. 
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Figure 5.4: Reconstruction fidelity vs measurement duration. This plot shows 
the average reconstruction fidelities for the backaction-free Schrodinger estimate, 
as a function of the length of the measurement record. Shown are traces for the 
6 qubit numbers considered, which are (in order of decreasing reconstruction 
fidelity) 100, 85, 70, 55, 40, and 25 qubits respectively. The vertical axis is a 
linear scale, with grid lines indicating the optimum fidelity bound for these same 
number of qubits. The averaging was over u = 1000 randomly chosen pure initial 
states. 




Fig. 5.4 shows an identical plot for the backaction-free Schrodinger estimate, 
showing that a larger number of qubits improves the reconstruction fidelity with a 
higher fidelity estimate at shorter measurement times. However, it also shows that 
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not including the backaction into the model significantly decreases the reconstruction 
fidelity at longer measurement times. This bias tends to be more pronounced as 
number of qubits increases. 

While the peak reconstruction fidehties between the two methods seem to be 
comparable (certainly within error bars), the fact that the Schrodinger evolution is 
biased away from an optimum state shows the importance of including the backaction 
in the dynamical model. 
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Chapter 6 

Summary and Outlook 



We conclude with a summary of each research chapter and a discussion of the possible 
avenues this research might take in the future. 

6.1 Quantum optics and quantum stochastic dif- 
ferential equations 

Chap. 2 derived a relation between quasi-monochromatic traveling wave packets 
and the bosonic Fock space necessary for defining a formal quantum Ito stochastic 
calculus. We identified a limit where the continuous-time tensor product decom- 
position is consistent with a quasi-monochromatic approximation. The limit was 
ultimately enforced by convolving any bounded, square-integrable complex function 
with a smoothing kernel, constraining the resulting object to be slowly-varying in 
time. A suitable white noise limit was identified when the kernel approached a 
delta function in conjunction with a limit where its derivative remained remained 
infinitesimal when compared to an optical period. This produced a separation of 
three timescales. The resulting quantum stochastic integral is on the slowest scale, 
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the delta correlated smoothing kernel is in the middle, and the fastest is an optical 
period. 

This version of a quantum white noise limit is new and distinct from two existing 
explanations. The first is a static picture of a bosonic heat bath lacking any dynam- 
ical flow of information, which does not capture the fundamental propagation of a 
traveling wave field [50, 51]. In an alternative description, Accardi et al. derive a 
quantum white noise limit through a rescaling argument [54, 55]. If the system field 
coupling Hamiltonian had a fundamental interaction strength of A, then by rescal- 
ing time as t ^ t/X^ and taking the limit A — > 0, a quantum white noise operator 
appears. While this is in no doubt mathematically correct, it is in our opinion ad 
hoc, as requiring time to be not just big, but specifically 1/A^-big is an artificial 
constraint. Here we do require a white noise limit in a "middle timescale", meaning 
that the smoothing envelop approaches a delta function {a 0) while the timescale 
of one carrier oscillation tends to zero faster {{aujo)^^ — > 0). We do not require any 
fixed or dehcate scahng law. Additionally, in our model the QSDE treatment stands 
independently from any system-field interaction, as long as that coupling respects 
the above approximations. 

After formulating this a quantum white noise approximation, we rederived a 
QSDE description of the propagator. This derivation applies the recently derived 
quantum Wong-Zakai theorem [43]. This result describes how a general interaction 
involving scattering operations converges to a valid QSDE. This result allows for 
a stochastic description of a dispersive Faraday interaction in a regime of a weak 
drive but high optical density. A fundamental characteristic of any scattering based 
propagator is that admits for the possibility of having multiple scattering events in 
the intermediate timescale. It effectively renormahzes over this effect. 

The scattering propagator we derive is similar to a propagator derived by Bouten 
and Silberfarb [80] . They derive a polarizabihty interaction for a 4-level atom starting 
from a QSDE expression for a quantum field coupling two ground states, via two ex- 
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cited states. They then adiabatically ehminated the excited atomic states under the 
usual approximations of weak excitation. The resulting propagator contained similar 
but not identical scattering processes. The difference is that the resulting integrands 
for the dA^ ^^'^ dA^ terms only contain single scattering events. By starting from 
a QSDE for a dipole interaction and then eliminated an atomic manifold, the atoms 
are only capable of making a single scattering transition in one intermediate time 
increment. It is unclear at this point which model is more applicable for describing 
the underlying physics or if both are equally "wrong", just in different ways. It 
should be noted that both models agree in the limit of a weak forward scattering 
rate and negligible spontaneous emission. More analysis is clearly needed to settle 
any debate. 



6.2 Classical and quantum probability theory 

Chap. 3 served as a review of both classical and quantum probability theory. It 
ultimately focused on the mapping, via the spectral theorem, between sets of com- 
muting observables and a classical probability space. When a family of operators 
pairwise commute, the underlying projectors, i.e the spectral measure P{dX), 
define the classical probability model (O, J^, P). The sample space Q is set of labels 
{A}, J-" is the smallest cr-algebra over Q, and the probability measure is defined by 
the quantum expectation value for a projector associated with any element in J-", 
P(ciA) = Tr(pP(ciA)). 

The utility of this mapping is that it allows us to define a classical stochastic 
process that is in some sense equivalent to a QND observable. When restricting 
our attention to operators that commute with the projectors defining the classical 
space, we are able to treat any operator A — J^ a(A) P{dX) in terms of the classical 
random variable a. When the eigenvectors of A do not form a complete basis in 
the underlying Hilbert space, there exist operators X that commutes with A but 
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still must be treated quantum mechanically. Rather than being a liability, this is 
in fact a feature, as it allows us to simplify a system-probe interaction by treating 
the system quantum mechanically and the probe as a classical random variable. 
We call the resulting description semiclassical, but it still captures the physics of a 
system-probe-measurement model, when {Yi}-^j are the measured probe observables. 
However a careful physicist should always check that any operator X considered in 
this model commutes with every Y^. 

We then apply this formalism to rederive the conditional master equation, when 
measurements are generated from the observation process {Yt — Uf{At+Al)Ut}t>o in 
vacuum expectation. This derivation is initially performed in the Heisenberg picture 
and is then referred to a quantum filter. In the quantum filtering language the filtered 
observables UlXUt are still operators, written as TTtiX), and are linear combinations 
of the projectors {P(ciA)}. When converted to a conditional master equation, we 
compute a system state pt that is defined on the system Hilbert space only. When 
taking an expectation value of a system operator X, we have the equality 



for every system operator X and time t. We also described how the quantum filter 
is equivalent to a purification of any generalized measurement scheme. 

When the conditional master equation has complete information, and the ini- 
tial condition is a pure state, then it is sufficient to use an equivalent Schrodinger 
equation and calculate the random system state vector \ipt)- Given a correct and 
complete description of the system, a conditional Schrodinger equation is identi- 
cal to the stochastic Schrodinger equation derived in the quantum optics literature 
and the choice of the measurement observables defines the specific unraveling of the 
unconditioned master equation. 

In a recent paper by Tsang and Caves, they define a quantum mechanics free 
subsystem as a set of time-dependent operators subject to the constraint that the 



(6.1) 
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operators commute for any time [81]. Given the set of Heisenberg picture operators 
{Oi{t) : i = 1, . . . , n}, if [Oi{t), Oj(t')] = for all t, and t', then they are free of 
the laws of quantum mechanics. This is exactly the same idea as the quantum to 
classical mapping via the spectral theorem and relies on exactly the same principle. 
We mention this result here as it is an example showing active research utilizing the 
mapping between quantum and classical structures. The tools of classical probability 
theory should have an important role to play in this line of research. In particular, 
the concept of a commutant should be invaluable as it describes the set of operators 
that are compatible with this subsystem. 

The results of Chap. 5 would not have been possible without noting that the 
fundamental noise process driving the conditional master equation is not simply a 
Wiener process, but is instead the measurement realization itself. With different 
initializations, the conditional master equation produces different innovations and 
only a few of them will have the statistics of a Wiener process. Prom a statistical 
perspective, the conditional master equation is an estimator that allows us to predict 
the outcomes of measurements performed on the system, given an ancilla coupled 
measurement record. Furthermore in classical estimation theory, the concepts of 
robustness and stability play an important roles, as it is a desirable for an estimator 
to be robust to modeling imperfections and incorrect initializations. The stability of 
the quantum filter shows that in most cases the conditional master equation is able to 
correct itself given bad initial information. This supports the case that a conditional 
quantum state should be viewed as a quantum analog to a classical estimator. Using 
this perspective, it is possible that one might be able to formulate a variant of the 
quantum filter, that is more robust to modeling errors or corrupting noise in the 
measurement signal. 

For real optical beams, the continuous-time tensor product decomposition was 
only an approximation, and that for short enough timescales the operators did not 
strictly commute. Outside of this approximation, it is not immediately clear if one 
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could formulate a truly commutative space of operators for the purpose of defining 
a conditional expectation. As the technology of ultrafast lasers progresses, it might 
be possible to experimentally test a regime where subsequent optical measurements 
almost commute and the conditional dynamics might then reveal surprising quantum 
effects. Doing so would likely require formulating a conditional expectation on a 
noncommutative von Neumann algebra, which in some cases is possible [76] , but the 
classical probabilistic interpretation is lost [25] . 

Von Neumann argued that a commuting approximation to almost commuting 
observablcs was always possible [82], however this has been shown to be not the 
case [83]. In the context of spin chains, Ogata recently established the existence 
of commuting approximations for "macroscopic" observations [84]. It is likely that 
these mathematically rigorous results will be invaluable in identifying the consistent 
information embedded in a sequence of noncommuting observations. 



6.3 Projection filtering for qudit ensembles 



The methods of differential geometry are a set of powerful and flexible tools, readily 
applied to a wide variety of problems. Orthogonal projections are fundamental to 
quantum theory. Chap. 4 combines both of these tools to derive an approximation 
to the conditional master equation for an ensemble of n qubits, given a diffusive 
measurement of the collective angular momentum projection J^, and in the presence 
of strong global rotations. The approximation was based on the ansatz that if the 
system was initialized in an identical tensor product state, g — p'^", then it should 
remain close to a state p'®" of some single qubit state p'. The approximation was 
made to find a modified evolution that preserved this symmetry. It was formulated 
by projecting the conditional master equation, acting on the state g, into the tangent 
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space of the manifold of states 

V = {p®" : p is a valid qubit state} . (6.2) 

We worked in a parametrization where the single qubit state is mapped to a vector 
defined within the unit Bloch ball. We were able to derive an analytic expression for 
a projection filter that describes the diffusion of the Bloch vector in a non-Euclidean 
but isotropic space. 

We subsequently tested the quality of the resulting approximation numerically for 
pure spin coherent states. These simulations were performed for a systems composed 
of 25 < n < 100 qubits under a variety of conditions. We make this comparison 
first without an external Hamiltonian, allowing the system to evolve only under 
the action of the measurement. In an exact description this model would produce 
a significant amount of spin squeezing and is confirmed in the simulations. The 
projection filter tracked the mean expectation values with ~ 90% accuracy but failed 
to describe the correlations induced by the squeezing, as it was designed to do. We 
then performed the same analysis in the presence of a Hamiltonian driving strong 
randomized rotations. In this case, spin squeezing failed to significantly accumulate, 
leading to a > 95% agreement between the exact and projected mean expectation 
values and an average fidehty > 80% between the projected and exact states for all 
qubit numbers tested. 

A natural extension of the projection filter is to move beyond qubits and consider 
higher spin systems. Unfortunately, the simplicity of the Bloch sphere is lost for 
d > 2. There certainly exist rf^ — 1 traceless, orthogonal, Hermitian matrices for 
decomposing a ci-dimensional quantum state. The problem is that in attempting to 
formulate a mapping between valid quantum states and a. (f — 1-dimensional ball, 
you find that not every point inside the ball, or its surface, corresponds to a valid 
quantum state [68]. The problem is that while the orthonormal matrices have a 
number of useful features, they do not share the same spectra, and so the boundary 



Chapter 6. Summary and Outlook 



213 



between valid and invalid states is not isotropic. For qubits, we happily ignored any 
issues involving the boundary between valid and invalid states. It is likely that for 
qudits willful ignorance may lead to disaster. The best course of action may be to 
seek a more abstract representation of the state. 

In addition to moving to a higher spin system, we can also consider correlated 
states. One family of correlated states of general interest are spin squeezed states. 
Finding a smooth parametrization for pure spin squeezed states is not difficult as the 
canonical example of spin squeezing is generated by a specific Hamiltonian [70] . By 
composing the one parameter group of squeezers with the group of SU (d) rotations, 
it is likely that one can describe the space of pure spin-d squeezed state as a (P- 
dimensional manifold, baring any issues with linear independence. Whether or not 
there is a wieldy metric induced on this space is a whole other question entirely. 

An additional complication is the unavoidable fact that for a model to be at 
all experimentally useful, it must be able to handle mixed states and decoherence. 
Adding single qubit decoherence to the separable projection filter is a trivial un- 
der taking. Any map that acts identically and independently on each qubit is, by 
definition, in the tangent space of identical separable states. The reason why our 
simulations only considered pure state dynamics is because generating exact simu- 
lations for n ~ 50 qubits is quite challenging when the total angular momentum is 
not a conserved quantity. Chase and Geremia derive a simulation technique that 
required only order parameters for exactly propagating n qubits under symmetry 
preserving local decoherence [79]. By applying this or a similar method, we expect 
to be able to extend our numerical tests to include some decoherence to the model. 

While the algorithmic and "optimal" nature of the projection filter is appealing, 
control and system engineers confronted by nonlinear problems have derived a num- 
ber of suboptimal but highly successful estimation techniques. Some of which are 
a linearized extended Kalman filters [85], "unscented" Kalman filters [86], Monte 
Carlo "particle" filters [87], symmetry preserving filters [88], and so on. Some or 
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all of these techniques may prove useful for partially observed quantum systems. 
Although, without a general mapping between quantum observables and classical 
statistics, none of these tools are applicable. 

6.4 Qubit State Reconstruction 

Chap. 5 applied the quantum filtering formalism to construct a tomographic estimate 
for an unknown initial quantum state from an ensemble of identical copies experi- 
encing a joint continuous measurement. We found a maximum likelihood estimate 
of the initial state, based upon the statistics of a single continuous measurement 
realization. The purpose of this work was to extend previous results using a contin- 
uous measurement for quantum state tomography, into a regime where the quantum 
backaction significantly affect the measurement statistics. In a numerical study with 
ideal conditions, we found that our estimate nearly saturate an optimum bound. 
Derived by Massar and Popescu, this bound states that the average reconstruction 
fidelity given n copies of a pure qubit state and no other prior information, the best 
average fidelity is given by (^)opt = (n -|- l)/(n -|- 2). 

The problem of identifying a tomographic estimate was mapped to a parameter 
estimation problem, where the statistics of the measurement record paramctrically 
depended upon the initial qubit state. We then found that the likelihood function 
for the measurement record was ultimately Gaussian, leading to an equivalence be- 
tween a maximum likelihood estimate and a least-squares estimate. Maximizing the 
likelihood function then ultimately reduced to minimizing the quadratic variation 
of an innovation process, computed from the measurement record and a conditional 
state estimate. When the conditional state corresponds to a "correct" description, 
then the innovation is a Wiener process, setting the minimum value to be ~ t. 

In order to make a numerical implementation computationally feasible, we ap- 
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proximated an exact innovation by one computed with the projection filter. As this 
reconstruction procedure is tied to the quahty of the projection filter, any improve- 
ments in its accuracy will almost surely improve the reconstruction fidelity. We 
expect that by extending the projection filter to include squeezed states, there will 
be near perfect agreement between an innovation computed from the projection filter 
and an innovation computed from the exact conditional master equation. 

The nature extensions of the projection filter carries over to the case of state re- 
construction. The principle of finding a least-squared estimate is system independent 
as long as evolution still described by a diffusive conditional master equation. How- 
ever when moving to qudits, the number of parameters we need to estimate grows 
unfavorably with d. It is likely that in the general case, it will no longer be feasible 
to simply sample from the compact parameter space and select the most likely candi- 
date. Parameter estimation in nonlinear statistical models is a well studied problem 
for classical systems and we beheve that a classical solution will be adaptable to the 
quantum case. One possible avenue to investigate is a statistical importance and 
resampling technique, a "particle filter" [87], which has already been adapted to a 
quantum parameter estimation problem [89]. 

From our perspective, it is an open question as to whether or not minimizing the 
innovation's quadratic variation is the optimum statistical test to use. In hypothesis 
testing, comparing the ratio of two likelihood functions has been shown to have the 
most predictive power out of all statistical tests. Prom that fact, one possible method 
for improving the reconstruction procedure is to compute the likelihood ratio between 
each candidate state and a master equation initialized in the completely mixed state. 
In hypothesis testing, the ratio is made by comparing the likelihood of the data being 
generated from your model compared to a null hypothesis. Por a quantum system 
the most logical null hypothesis is the completely mixed state. It is possible that 
by computing this likelihood ratio we will be able to better discriminate the signal 
arising from the initial state from the signal caused by the quantum backaction. 
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It is still an open question as to why this continuous measurement scheme ap- 
proaches the optimum bound computed by Massar and Popcscu. Because the numer- 
ical results perform so well, it is likely that the randomized controls are mapping the 
continuous measurement to a uniform measure over all spin coherent states, as this 
is known to achieve the optimum bound [74]. Understanding what effective POVM 
a given controUed-continuous measurement implements will likely be a powerful re- 
sult in itself. Armed with that knowledge, a clever experimentahst could engineer 
any number of complicated measurement protocols. Not the least of which being 
unambiguous state discrimination [63]. 
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Appendix A 



Paraxial Optics 



This appendix review the paraxial wave equation and ultimately calculates Fourier 
transform of a paraxial mode function. The paraxial wave equation begins by assum- 
ing a quasi-monochromatic solution to the wave equation that takes the form of a 
rapidly oscillating plane wave, ex.p{+i{koZ — ouot)) modulate by an envelope function 
that changes slowly in both space and time. If there is a vector valued function 
W(x, t) satisfying the wave equation 

V'U{^,t)--^Ui^,t)^0, (A.l) 

then we hypothesize a real- valued solution of the form Vl{x.,t) = tl^~^\x,t) + c.c. 
where 

Z^(+) (x, t) = (x, t)e+i(koz-u>ot) (A_2) 
and u^'^\'x.,t) is a slowly varying function. Slowly varying is characterized by the 
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Based upon this assumption one can then neglect terms in the wave equation that are 
second derivatives with respect to z and t. The result is the paraxial wave equation, 

_V|nW+,(^ + -^j=0. (A.4) 

denotes the Laplacian with respect to the remaining transverse coordinates 
{x,y}, and we will denote the transverse direction as x^. It is often said that 
the paraxial approximation is valid under the assumption that the envelope function 
u^'^^ varies slowly when compared to an optical period, 2Tr/uJo. 

Under the change of variables (x; t) (xt, z; t — z/c) the paraxial wave equation 
becomes independent of retarded time tr — t — z/c. Without a loss of generality it 
can be assumed that 

U^+\^,t)^f(tr)ui+\^T,z)e-'^°'^ (A.5) 

for any slowly varying /. In this case we have, 

-^V^4+HxT,^) = -z^4+HxT,^). (A.6) 

Zkq OZ 

If we make the replacement z ^ t and ko — )• m/h then Eq. (A.4) is identical to 
a two-dimensional Schrodinger equation in free space. Furthermore in the Fourier 
domain, each plane wave component is an eigenstate of the "Hamiltonian" ^'Vt 
ultimately implying, 

Ik I 

4+)(kT, z) = 4+^(kT, 0)e-'^'. (A.7) 

where k^ = kxBx + kyBy. Note that this is Fourier transform is with respect to the 
transverse components only and is still a function of the longitudinal component z. 
Taking the Fourier transform of the full solution W(x, t) — > W(k, t), 

W(+)(k, t) = 4+^(kr, ^ = 0) / -^e-'^^'e-'^'f{t - z/c)e'^'^'-'''\ (A.8) 

J v 27r 
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If we change variable from taking the spatial transform with respect to z to trans- 
forming with respect to the retarded time tr we find that 

U^+\k,t) = cf{c\kT\y{2kQ) + ck,-uJo)u^T\kT,0)e \ ^ V (A.9) 

where /(cu) is the temporal Fourier transform of /. 

We know that regardless of any approximations the positive frequency compo- 
nent of a travehng wave solution evolves according to ZY^'*'^(k, t) — W^'*"^(k, 0)e~*'^l'*l*. 
Evidently, the paraxial approximation is an approximation that 

Ik P 

u{k) = c\k\^cC-^ + h). (A.IO) 

This approximation seems slightly at odds with the fact that |k| is strictly nonneg- 
ative, while the right-hand side of Eq. (A.IO) extends to negative frequencies. This 
issue is resolved by the assumption that u^^\x,t) is a slowly varying function or 
equivalently u.^^\k,t) is a sharply peaked function about k = 0. The resolution is 
that because of the carrier plane wave, U^~^^ (k, t) is a sharply peaked function about 
ko = koez- While in principle uj{k) can be negative, these negative components 
never contribute as long as the paraxial approximation holds. Finally we find that 

U^+\k,t) = c/(a;(k) -a;o) 4^^(kT, 0) e'^'^^'')*. (A.ll) 
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Appendix B 

Classical Stochastic Calculus 



This appendix reviews the derivation of the Stratonovich and Ito integrals as well as 
the rules of Ito calculus. We attempt to describe the salient points found in standard 
texts while leaving out the proofs. 

Stochastic integration, in either the quantum or classical sense begins from a form 
of functional integration. The traditional Riemann integral takes a function f{t) 
and integrates with respect to a small difference in its argument At. In a functional 
integral, as defined by Stieltjes, the function f{t) is integrated with respect to a small 
difference in another function g{t), i.e. 

/ f{s)dg{s) ^ hm J2fit:) (giU) - g{ti-i)) (B.l) 

1=1 

where ti-i <t*<ti. This limit can be shown to make sense if / and g are reasonably 
well behaved. One limitation is that g can't vary "too much" over a time interval At, 
(the total variation of g must be finite) [53]. Like the usual definition of a Riemann 
integral, the convergence of this integral does not depend upon where t* lies in the 
interval [tj-i, tj]. 

A stochastic integral, often called a stochastic differential equation (SDE), re- 
places both functions / and g by stochastic processes. However in this replacement 
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the problems of working with nondifferentiable functions leads to a more delicate 
situation. The fact that Brownian motion has a nowhere smooth trajectory means 
that its total variation is infinite, leading to a divergence in a Riemann-Stieltjes 
limit [22]. Not only does this mean that we are forced to consider a different kind 
of limit, but the choice of t* makes a dramatic difference on its mathematical and 
statistical properties. Specifically, if t* — ti-i, one arrives with an Ito integral, which 
has several desirable statistical properties but does not obey the chain rule as seen 
in ordinary calculus. If, however, t* is taken at the midpoint of the interval, called 
a Stratonovich integral, then the rules of calculus are preserved but the statistical 
properties are more involved. Fortunately there exists a simple conversion between 
the two integral definitions. We will discuss all of these issues in greater detail in the 
following section. 

Concretely, the Ito and Stratonovich integrals begin with the following definitions. 
Consider the partitioning of the time interval [0, r] into in increasingly dense mesh 
n-ordered times {t„ : n e N}, < ti < ■ ■ • < tn — r. The Ito integral takes the 
(time-adapted) process {a;t}^>o and defines the integral of the well-behaved function 
b{xt), with respect to the Wiener process Wf, to be 

/ b{xt) dwt = lim b{xt^_-,) {wt^ - (B.2) 

•^0 i=l 

This limit is then shown to converge to an almost unique object, with probability 
one^ The Stratonovich integral takes a different definition, 

/ b{xt) o dwt = lim V 6 (^^i^) {wt, - wt,-i). (B.3) 

•'O i=l 

These two definitions arrive at fundamentally different, but not unrelated, integrals. 
One integral can be converted to another by using a simple trick, derived in Sec. 
B.1.1. 



^ There is a slight caveat where one could add another random process which happens 
to have zero probability of ever occurring. 
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Before moving on to discussing the operational and statistical properties of these 
integrals it is worth noting, that in attempting to model a classical physical sys- 
tem, with SDEs the choice of calculus is crucial. Fortunately, the question as to 
which calculus to use is answered by the Wong-Zakai theorem [90] (see the intro- 
duction to Appendix D). In this paper they showed that an ordinary differential 
equation containing a piecewise smooth approximation to Brownian motion, limits 
to a Stratonovich equation and not an Ito equation. If one derives an equation of mo- 
tion for a system including an approximation to Brownian motion then the solution 
to that equation must be interpreted in the Stratonovich sense. 



The rules of Ito calculus can be derived with varying levels of detail and sophistica- 
tion. Their practical purpose is to give a method for manipulating and combining 
multiple Ito integrals into new and different expressions. The bottom line result is 
that the standard differential chain rule, d{fg) — f'dg -\- g'df must be extended to 
include a second order correction, see Eqs. (B.14 - B.15). An often cited reference 
for the derivation of the Ito integral and Ito calculus is the book by Oksendal [22] . 
There he shows how the limits in Eqs. (B.2) and (B.3) may or may not converge. 
Specifically, the standard techniques for defining a integral with respect to a Rie- 
mann sum fails, because in doings you ultimately consider the quantity, called the 
total variation. 



for the partition of times a < < ■ ■ ■ < tn — h. What you can show is that with 
probability one, this is infinite for the Wiener process. However you can also show 
that instead of summing the absolute value of each increment, \wt^—Wt^-\\■| you sum 
the square of each increment, (wj. — then you obtain a finite quantity. This 



B.l Ito Calculus 



n 




(B.4) 



i=l 
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is called the quadratic variation, and with probability one, 

n 

lim y^{wt^ - wt^-l)'^ ^b - a. (B.5) 

i=\ 

What this is saying is that while the Wiener process travels an infinite absolute 
distance in any finite time, it RMS displacement only grows like the square root of 
time. Using the fact that the Wiener process has a well behaved quadratic variation, 
the limits of the Ito and Stratonovich integrals make sense if you consider their 
squared expectation value, the so called >C^(P) limit. 

For a fiavor for how a squared expectation value might make sense for a Wiener 
process, observe that as it is constructed to have Gaussian statistics for any finite 
interval, 

jim ^[{Y.^ K - ^U-r ) j j = E (( - )2) = t _ 5. (B.6) 

The way this is turned into an integral is that if one has the time- adapted stochastic 
process bt, i.e. it is assumed to be independent of Wf for times t' > t, then it can be 
shown that 

limE((f]6,^_,K-^,,_j)M = limE(f]6t,(t^-t.-i)) (B.7) 



i=l / \i=l 



as long as E ^ b^ dt^ < oo [22] . A fair amount of analysis goes into showing for 
what kinds of processes a piecewise constant approximation. More work is needed 
to extend the proof to hold with probability one. We mention this here only because 
it indicates the fine of reasoning that relates the product of two Ito integrals to a 

integral over time of the product of the integrands. 

Moving from a consistent definition of an integral to a mature calculus involves 
placing a constraint on what kind of integrands we are able to use in a stochastic 
integral. Consider the example of the recursively defined Ito process 

Xt — Xo+ / a{s,Xs)ds+ / b{s,Xs)dWs, (B-8) 
Jo Jo 
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where a and b are continuous functions that are once differentiable in time and twice 
in X. An extremely common and useful notational device is to write an Ito integral 
in a differential form, 

dxt = a{t,Xt) dt + b{s,Xt) dwt (B-9) 

to represent that integral. The utility of this notation is apparent in that if Xt were 
an ordinary deterministic equation, (setting b{t, Xt) = 0) then Xt is the solution to 
the equation 

-^ = a{t,x). (B.IO) 
This is why that these kinds of equations are called stochastic differential equations. 

A typical application is to consider two such equations, so in addition to Xt we 
have the differential 

dyt^ c{t,yt)dt + d{t,yt)dwt. (B-H) 

We wish to find a differential for the product Xtyt or even some other function 
f{xt,yt)- The answer to the first example is that 

d{xt yt) = ait, Xt) yt dt + b{t, xt) yt dwt 

+ xtc{t,yt)dt + xtd{t,yt)dwt (B-12) 
+ b{t,xt) d{t,yt) dt. 

This expression can be derived by computing the Z)^(P), Atj — )> limit of the def- 
inition of the Ito integral. It is important to emphasize that the right-hand side of 
this equation is itself another Ito integral. By multiplying, adding, and subtracting 
Ito integrals one still finds "just" another Ito integral, with the whole closing upon 
itself to form an algebra. Compare this example to a second order Taylor expansion 
of the product Xtyt, 

d{xt Vt) = dxt yt + Xt dyt + dxt dyt. (B. 13) 
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If we apply the Ito rules to the second order product dxtdyt, the only surviving 
term is (bdwt) {ddwt) = bddt. This means that there is a consistency between Eqs. 
(B.12) and (B.13). The general case for some function f{xt, Ut) is given by the second 
order Taylor expansion, 

1 d'^fjxt, yt) ^ , Id^fixt^yt), , , d'^f(xt,yt) ^ , 

+ 2 dxj + 2 dy^ + -^^^ ^^-^'^ 

where 

dxt dxt — y^^t, Xt) dt, (B.15) 

dytdyt = d'^{t,yt) dt, and (B.16) 

dxt dyt = b{t, Xt) d{t, Xt) dt. (B.17) 

In order for this general Ito expansion to be well defined, f{x,y) must have a finite 
first and second derivatives. 

B.1.1 The Ito conversion 

Supposed we have the recursive Itb form SDE such that 

Xt- — a{xt) dt + / h{xt) dwt- (B.18) 
Jo Jo 

We then assert that this same process has a corresponding Stratonovich form, 

Xt— j a{xt)dt+ / h{xt) o dwf. (B.19) 
Jq Jo 

(a has no relation to the Fourier transform.) Our goal is to then find a relation 
between the functions a{x) and d{x) that makes this assertion true. Subtracting 
these two expressions lead to the equality. 



rT pT pT pT 

/ b(xt) o dwf — I b{xt) dwf — I a{xt) dt — d{xt) dt. (B.20) 
Jo Jo Jo Jo 
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This difference is known as tfie Ito correction term. 

To ligfiten tlie notation we will define the intervals, Axj = Xj. — Xt^_^, Awi 
Wt^ — Wt^_^ and Ati = — The Ito correction can then be written as 



i=l 
n 

lim ih ixt,_^ + \ Axi) - h{xi-i)) Awi 

i=l 



(B.21) 



We can write the integrand very suggestively in terms of a prelimit form of the 
derivative of b{x). By defining 

£(^) ^ ^(^±M^ (B.22) 

we then have 

b {xt,_, + I Axi) - b{xi_i) = ^ ^(^ti-i) ^^i- (B-23) 

With a recursive definition for Xt, we can substitute Axi to find 
" 1 Ab 

= 2 A^^"'*'-^^ + Ka^i^-J Aw,) Aw,. (B.24) 

i=l 

However, by the rules of Ito calculus (i.e. AtiAwi — )■ and AwiAwi — )■ Atj with 
probability 1) the whole expression converges to 

/c= (B.25) 

And so we ultimately find that 

a{x) = a{x) b{x). (B.26) 

This conversion between Itb and Stratonovich equations is vitally important in Chap. 
4, where the first order rules of ordinary calculus allows for the application of differ- 
ential geometry to a stochastic system. Sec. 4.2.4 gives an example for why using 
Stratonovich calculus is necessary. 
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Appendix C 



Quantum Stochastic Calculus 



This appendix reviews the basis notation and properties of an Ito form quantum 
stochastic differential equation (QSDE). Sec. 2.5 discusses at length how a bosonic 
Fock space defined over the single particle Hilbert space f) = >C^(R+) (g) C^, 

has the operators Q] and P/. Each of these operators have the statistics of Wiener 
processes when taken in vacuum expectation. They are constructed though linear 
combinations of the annihilation and creation operators, Al = a[X[o^t] ^i] and A]"^ = 
tf^[^[o,t] Gj], which are also used to form non-Hermitian quantum Ito integrals. In 
addition to these processes, Sec. 2.6.2 encountered a different kind of operator, the 
scatter or conservation processes A]^ . Here we will gloss over how A]^ can be defined 
in terms of an operation acting on the single particle Hilbert space (see [45]). Note 
that [A^y = Af . To define an integral with respect to A^, A\\ and A\\ it is 
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sufficient to know tlie matrix elements^, 

{c[t]\Al |e[h]) = fds h,{s) (c[f]|c[g]) , (C.la) 

(^e[f]\Al^ \c[h]) = iysf;is) {e[f]\e[g]), and (C.lb) 

(e[f]| |e[h]> = fds ftis) h,{s) (e[f]|e[g]) . (C.lc) 

Tfie derivation of tfie quantum Ito integral and tfie equivalent stochastic calculus 
begins in much of the say way as the classical Ito integral. Eq. (B.2) gives the limiting 
sequence used in defining the Ito integral, with its characteristic nonanticipative 
integrand. The quantum Ito integral is given by a similar form, where the integral 
with respect to dAl is given by the limiting sum, (with the mesh of times — Q < 
ti < ...tn<t) 

X, dAi ^ Jim J2 ^U-. (4 - 4-. ) ■ (C-2) 

Similar definitions are made for integrals with respect to Ai\ , and time. The 
integrand Xg is required to be a time-adapted process meaning that it is required 
to act as the identity on the Fock space ^(l)(s,oo))- Note that in general, Xg is not 
required to act solely on the Fock space =^(f)[o,s])- It could be an operator defined over 
a joint space Hsys ® =^(f)[o,s])- Because the integrand is required to be time-adapted 
it commutes with the differential and so we can write, 

Yt= ! Xs dAi = / dAi X,. (C.3) 
Jo Jo 

We will use both forms, whichever is more convenient. It should not be that surpris- 
ing that proving that the above integrals exists and are finite is more difficult than 
in a classical setting. We will not reproduce the full result here, see [23, 52] for the 
proof. 



^For technical reasons the amplitudes of these states are assumed to be square integrable 
and have a large but finite upper bound [25, 45]. 
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However to get a sense of where the quantum Ito rule comes from, we will discuss 
a few key points. The proof of convergence of the infinitesimal limit is shown by 
taking a piecewise constant approximation, and then proving convergence of the 
matrix elements. The set of vectors chosen for those matrix elements are the tensor 
product of any system pure state and an exponential vector. By showing that it 
hold for these matrix elements, you can then extend the result to hold in expectation 
with any state composed of convex hnear combinations of these vectors. For example, 
consider the integral Yt in Eq. (C.2), 



n— >-oo 

i=l •'li-i 



(^®e[f]|Ft|^®e[h]) = lim ^ / ' ds hi{s) ® e[f] | X^,. ® e[h]> . (C.4) 



For a reasonably large class of integrands the piecewise constant approximation is 
appropriate and converges to 



{jjj^e[f]\Yt\iP®e[h]) = I dshiis) (^/^ ® e[f] | X, |^/^ ® e[h]) . (C.5) 





Equivalent expressions occur for integrals with respect to dA^^ and dAl^ , where hi{s) 
is replaced by the correct amplitudes as given in Eq. (C.l). 

In order to define a proper calculus, one must also consider how products of the 
integrals behave. The elementary step is to consider the matrix elements of two 
operator combinations of A^, A^^ and A]^ . The matrix elements (e[f]| AlA^ |e[h]). 



e[f] Af Ai^ c[h]y, and (e[f]| A]^ A^ |e[h]) are easily to calculated, as they involve 
an eigenvalue relationship of At and the matrix elements given in Eq. (C.l). The 
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nonobvious two operator matrix elements are [52, Proposition 20.13] 



c[f] AlAi^ 



e[hl 



ds f. 



Jo 



ds hi{s) 



+ 5,, / ds (e[f]|e[g]), 



(C.6a) 



e[f] 



AIM' 



e[h] 



e[f] 



ds f*{s) hk{s) / ds hi{s) 
Jo 

+ ds hk{s)^ (e[f]|e[g]), 

f ds f*{s)h,{s) f dsfl{s) 
Jo Jo 

+ (e[f]|e[g]), and 

(e[f] I M' Af |e[h]> = {^f^ ds ftis) hj(s) ds ms)he(s) 

+ 5^,j\sf;{s)h,{s)^ (e[f]|e[g]). 



(C.6b) 



(C.6c) 



(C.6d) 



Calculating these expressions requires knowledge of the commutation relations be- 
tween all of the noises and/or their relation to the exponential vectors. Without 
discussing the origin of A^, writing down the commutation relations are no more 
intuitive and less useful than above matrix elements. Inspecting these four equations 
shows that each has two parts. The first is essentially the product of the single oper- 
ator matrix elements, up to a factor of (e[f]|e[g]). The second part is an additional 
term due to the noncommuting structure of the processes. If we express these four 
matrix elements in terms of differentials, 

'e[f] I d^t dAp |e[h]J) = (0{dt'') + Sij dt) (e[f]|e[g]) , 

eifl' 



dAl dAl 



e[f] 



jk 

dK'dA^^ 



e[h]) = (0(dt2) + 5,.dt/ife(t)) (e[f]|e[g]), 
e[h]) = {0{de) + 5^kdtf*{t)) (e[f]|e[g]), and 



{e[i]\dl^i dhf |e[h]> = (0{de) + 5,kdtrdt)hi{t)) (e[f]|e[g]) . 



(C.7a) 
(C.7b) 
(C.7c) 
(C.7d) 



Notice that each term on the order of dt is expressible in terms of the matrix element 
of a differential, dt, dA^, etc. Taking the terms 0{dt'^) 0, and asserting that 
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d^dAi^ 


= 5ij dt, 


dAi rfAf 


= 6ij dAt, 


dAl' dylf 


= SjkdAl^ 


dAi' dkf 


= Sjk dAf. 



knowing these matrix elements is sufficient, we fiave tlie quantum Ito rules 

(C.8a) 
(C.8b) 
(C.8c) 
(C.Sd) 

All other differential products are zero. 

The remainder of the construction of the quantum Ito calculus is technical, and 
involves showing how the product of two piecewise approximations converge as Ati — >■ 
0, as well as dealing with the equality 

{Xt Tjj ® e[f] \Yt jjj (g) e[h]) e[f] XjYt t/j ® e[h]\ (C.9) 



for two integrals Xf and Yt. These issues arc well beyond our scope, except that we 
will note that when X^ and Y^ are bounded in a suitable sense, all things work out 
nicely [25, 45]. 

Before moving on to the specifics of a QSDE for a unitary propagator, wc will give 
the following general example for using the quantum Itb rule. (See [45, proposition 
2.4] for all of the necessary qualifiers and assumptions.) Consider the quantum 
stochastic process Xt, given by the Ito integral (implied sums on repeated indices), 

Xt^Xo+ [ dA'jFi^+ [ dA'jFf+ [ dAlF^^+ [ dsF°°. (C.IO) 
Jo Jo Jo Jo 

The processes F^^ are assumed to act nontrivially only on the joint Hilbert space 
'Hsys ® ^([)[o,s]) and are integrable (without really defining what that means). The 
initial value Xq is assumed to be a bounded operator that acts as the identity on 
^(f)). This integral can also be notated differentially as the QSDE 

dXt = F^' dAi' + Ff dAj^ + F^' dAi + F°° dt. (C.ll) 

Given another process Yt whose differential is 

dYt - Ki' dAi' + dAi^ + K^' dAi + dt, (C. 12) 
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the product XtYt can also be expressed in terms of a process Zt = XtYt whose 
differential is given by 

dZt = Mi' dAf + dAi ^ + M°^' dAi + M°° dt, (C. 13) 

where the resulting integrands M^^ are 

Mi^ = Xt Ki^ + Fi^ Yt + , (C . 14a) 

Mf = Xt Kf + Yt + i^'-' A7° , (C. 14b) 

M°^' = + F°^' Ft + Ki' , and (C. 14c) 

= Xt + F°° Yt + F^' Kf. (C.14d) 

This result can be viewed as an application of the quantum Ito product rule 

d{XtYt) = Xt dYt + dXt Yt + dXt dYt, (C.15) 

where dXf dYt is multiplied out and the second order differentials are evaluated ac- 
cording to the quantum Ito rules in Eq. (C.8). 



C.l The Quantum Stochastic Unitary 

With the general quantum Ito rule firmly in hand, we would like to apply it to 
find a universal expression for a unitary process Ut- This is actually quite straight 
forward by first noting that because its unitary, UtUt = 1. Wc also know that if 
Ut is independent from the fundamental processes At, AI and A^, than Ut is the 
solution to the ordinary differential equation dUt — —iHt Ut dt. When including 
the fundamental processes, the objective is to write Ut as a, general QSDE and find 
how unitary constrains the various integrands. Taking the "noise free" solution as a 
starting point we hypothesize the coefficients Gf'^ so that Ut is given by the QSDE 

dUt = Gf Ut dAf + Gf Ut dA^t^ + Ut dA^ + Gf Ut dt (C. 16) 
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and its adjoint is 

dU} = U} Gf dM' + Ul Gf dAl + dAi ^ + U} Gl^^ dt. (C. 17) 

The unitary constraint's impact on the differential is that d{Ul Ut) = d{Ut U}) = 0. 
The general Ito product coefficients in Eq. (C.14) then says that in order for this to 
unitary, 

U} Gf Ut + ul Gf " ^ Ut + ul G\^ ^ Gf f/t = 

(C.18) 

Ut ul Gf"^ + Gf Ut ul + Gf Ut ul Gf ^ = 

for a, (3 starting at zero and the implied sum over i starting from 1. Eliminating Ut 
and ul from the constraints, they simplify to 

Gf + Gf^ + Gf^ Gf = Gf + Gf^ + GfGf^ = 0. (C.19) 

The coefficients G""^ are typically written in terms of a different set of operators, 
Sl\ L\ and Ht. The reason for this transformation is that {Sl\L\,Ht) have more 
desirable and physically relevant properties than G"^ . Immediately we can see that 
some part of should be —iHf, as the general QSDE solution contains the case 
where U {t) — ex'p[—iHt) for a time independent Hamiltonian H. Also if G'^' = G*° = 
then Eq.(C.19) reads as G°° = -G°°t, implying that G?° = -iHt for Hermitian 

To identify how S'^-' fits into the picture, consider for the moment the case where 
each G]^ — gijl for some complex coefficients Qij. Then the constraints for G]^ are 

9ij + 9ji + 9*ii gtj = Qij + g*ji + guglj = o (c.20) 

Writing the constants in term of a matrix G we have 



G + + G = 0. 



(C.21) 
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If we define a matrix S = G + 1 then this constraint reads, 
= S-1 + S^-1 + (S^- - 1) 

= S-l + S^-l + (StS-St-S + l) (C.22) 

1 = S^S. 

In other words S is a unitary matrix. Returning to the general case, we can still 
define the operators 

S^ = G^ + 5ij. (C.23) 
Then Eq. (C.19) transforms the constraint for Gl^ into the constraint, 

Sr^St = Si^S^^^ = S,k. (C.24) 
In other words. Si'' is a unitary matrix of operators. 

By introducing the constraint for G^* is also significantly simpler. Specifically, 

= 6-°^ + Gf ^ + Gf ^ (5f - Su) (C.25) 

The remaining coefficients Gf are essentially arbitrary, which is relabeled as the 

operators L^. Writing the constraint for in terms of L\ means that + = 



■a/3 



Bringing all of these results together we can reexpress in terms of {Si'' , LJ, Ht) , 



G? = LI, 

(C.26) 

Gf = -iH, - \V^L\. 
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In other words, 

dUt = ( {si' - Sij) d^i + Lj dAj^ - Lf 5f dJ^t - \V^L\ dt - iHt dt ) Uf (C.27) 

This is the standard form for the QSDE for a general propagator Uf. While in 
principle, the initial value for Uq could be any unitary operator acting on a system 
T-Lsys, typically Ut describes an interaction picture representation of the system-field 

dynamic, and then Uq = 1- 

One final remark is that if we take each coefficient to be its own stochastic process, 
{Gf^Ut}t>o, then they are required to still be time-adapted. This results in the 
constraint that the initial values Sq , Lq and Hq must all be system operators only, 
as they must act as the identity on the field at that time. Furthermore if 5'^-', L] and 
Ht are known to be time independent, then they must be system operators only. 



C.1.1 Unitary evolution 

One final calculation we will include is the unitary evolution of a time independent 
system operator X. In the quantum stochastic literature this unitary evolution is 
written in terms of a map jt{-) generating the "flow" or current of the operator, 

jt{X) = UjXUf (C.28) 

This map can a written as a solution to a QSDE, 

jt{X) = ulxUo + f djs{X), (C.29) 
Jo 

with a differential 

djt{X) = dU^ XUt + UlX dUt + dUlXdUf (C.30) 
After an exercise in quantum stochastic calculus, one finds the recursive QSDE 

djtiX) = 3t{CnX)) dAi^+MCf{X)) dAl^+MC'.'iX)) dAl+j,{Cf\X)) dt (C.31) 
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where C'^^{-) are known as the Evens-Hudson maps and in terms of G"^ are 

Cf{X) = Gf'^^X + XGf + G^t'^'^XGf. (C.32) 

When written in terms of {Sl\ L\, Ht) these maps are, 

Ci {X) = S^' ^XS^' - 5ijX, (C .33a) 

Cf{X)^S'^'^[X,Ll], (C.33b) 

£°^■(X) = -[X,Lf]^f^ (C.33C) 

= +i[H„ X] + Li^XLi - \Vt^L\X - IXLi^Li (C.33d) 

For an arbitrary model the unitary cvohition becomes exceedingly complicated very 
rapidly. Each coefficient in Eq. (C.31) is itself given by the unitary flow of the 
operator Zt = Cf^{X). Systems lacking any kind of fundamental symmetry will 
rarely close on a useful subspace of operators meaning that after repeated applications 
of C''^^{-) more complicated operators will be generated, spanning a larger and larger 
space of operators. 

In addition to calculating the unitary output of system operators, it is also useful 
to calculate the output for the fundamental field operators A{\ A^, . In a rather 
tedious exercise in manipulating the quantum Ito rules it can be shown that 

dj,{j^,) =jt{St) dAl + MU) dt, (C.34a) 

djt{Al^) =jt{Sl'^)dA':^+j,{Ll^)dt, (C.34b) 

dUAf) ^jt(Sl'^Si') dAf + 3t{Sf^Ll) dA\^ + m^Sl') dAf + j,{V,^Li) dt. 

(C.34c) 
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Appendix D 

The Quantum Wong-Zakai 
Theorem 



In a classical system, the convergence of an ordinary differential equation to a stochas- 
tic one was treated in the work of Wong and Zakai [90] . There they show that an 

ODE containing a picccwisc-smooth approximation to white noise, converges to 
a Stratonovich integral as ^^'^^ with A ^ 0. For instance. Consider the ODE 

^^^'^^'^ = f{t,x^'\t))+git,x^'\t))^i'\ (D.l) 



dt 

The Wong-Zakai theorem states that the integrated solution 

xW(i)=xW(0)+ f dsf(s,x^^\t))+ f dsg(s,x^^\s))^i^^ (D.2) 
Jo Jo 

converges to the Stratonovich integral 

Xt = Xo+ / f{x{t),s)ds+ / g{x{s),s) o dwg. (D.3) 
Jo Jo 



Appendix B reviews the distinctions between the two most common forms of 
classical stochastic integration, the Ito integral and the Stratonovich integral and 
Appendix C discusses their quantum analogs. This appendix reviews the quantum 
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analog of this result where the specific ODE is for a propagator whose Hamiltonian 
contains field operators that are limiting to quantum white noise. 

In 2006, Gough derived a quantum hmit, equivalent to the Wong-Zakai theorem 
[43]. Specifically, he investigated the convergence of the Schrodinger equation, writ- 
ten in terms of the time evolution operator U{t), as field operators in the Hamiltonian 
converge to singular, delta-commuting operators. The specific Hamiltonian consider 
is given in Eq. (D.9), but before discussing it, we will first describe the quantum 
version of ^^^^ and how it can be interpreted as quantum white noise. 



D.l Quantum white noise 

In order to make a connection with a quantum Ito integral as formulated by Hudson 
and Parthasarthy, the limiting field operators clearly must be reference to a Fock 
space ^{i}'), f)' = /^^(M"*") (8) C*^. The delta commuting hmit is introduced by con- 
sidering the dijjerentiable functions {^i'^\t) G [)' : i = 1, . . . ,d} parameterized by 
A > so that we have the field operators oiCl^H^)] and d^[^\'^\t')], with 



a[^l'\t)], at[ef )(t')]J = {^i'\t), (t')) ^ %(A,t - t'). (D.4) 

This inner product is assumed to satisfy the properties: 

/oo 

dt Cij{X,t) < oo, (D.5a) 
-oo 

Cij(X, t) = 4(A, -t), and (D.5b) 

limCjj(A, t) = SijS{t). (D.5c) 



A-)-0 



To simplify the notation, it is convenient to write di{X,t) = ci[^i^\t)]. 

These operators end up serving two purposes in the quantum Wong-Zakai theo- 
rem. The first is of course to act in the limiting Hamiltonian and the second is to 
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generate the smoothed exponential vectors, 



e[g(A)] = exp 




oo 



dt g^{t)al{\,t) |0). 



(D.6) 



Note that in relation to the one-dimensional representation of Sec. 2.4, di{\,t) 
is almost equivalent to d[ip'^'^\t)]. The differences lies in how the smoothed wave 
packets arc defined. One possible mapping between paraxial optics and the abstract 
operators ai{X,t) is to identify d paraxial spatial mode functions uI~^\xt, z), which 
satisfy the orthogonality relation, J d'^XTU*{x.T, z) ■ Uj{x.T,z) — Sijax- For each 
paraxial mode there are d independent complex wave packet envelopes, inducing the 
smoothing operators d[(pl'^\t)]. In the case of a single paraxial mode, Eq. (2.107) 
gave the expression for a smoothed paraxial wavcpackct g'^°")(k, t). In order for this 
expression to be equivalent to the argument of Eq. (D.6), we require that 



The fact that we require the time reversed version of (f l is an artifact of defining 
the smoothing with respect to a convolution. The inclusion of the carrier phase is 
both mathematically and physically interesting. Physically it is a reminder that the 
elements of f)' represent the part of the light existing on the measurement timescale, 
which is much slower than the carrier frequency. In fact the appearance of this 
phase is intimately related to the rotating wave approximation. In the rotating 
frame atomic transition operators develop explicit time dependence at the carrier 
frequency. The mathematical relevance of this carrier phase is that it cancels any 
rapidly oscillating phases in the inner product, (^(pf'\—t), (pf^\—t')'^. This cance- 
lation is explicitly apparent by computing that 



a,(A,t)-e+-°*a[<^fV^)]- 



(D.7) 




(D.8) 
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where in the final fine we inserted the unequal time inner product in Eq. (2.92), as 
well as remember that for the smoother ip["\ ||g|| — )■ 1. A is simply a parameter 
representing the formal limit that as A ^ 0, cr ^ and {auJo)~^ 0. 



D.2 The quantum Wong-Zakai theorem 

The Hamiltonian that Gough ultimately considers is, 

(d d d 

J2 E^j al(A, t) aj{X, t) + J2 ^io al(A, t) + ^ Eoj aj{\, t) + Eoo 
i,j=l 1=1 j=l 

(D.9) 

The quantum Wong-Zakai theorem then takes the solution to the equation 

fu{X,t) = -'-Hi^t{\t)U{\t), C/(A,0) = 1 (D.IO) 

and proves that the limit lim;^_^o ^) = C^t is a quantum stochastic unitary pro- 
cess, solving a quantum Stratonovich differential equation. Fortunately, a quantum 
Stratonovich integral is also expressible in terms of a quantum Ito integral. Gough 
provides a well constructed conversion between both forms, which we review shortly. 
The specifics of this limit is that it is shown to hold "weakly" , in that 

hm (g) e[f (A)]| C/(A, t) 10 ® e[g(A)]) = (V' e[f]| Ut \4> (g) e[g]) , (D.ll) 

for any system state vectors -0 and (j) and the exponential vectors e[f (A)] and e[g(A)] 
as defined in Eq. (D.6). In addition to the matrix elements of the propagator, it is 
also shown to hold weakly for the Heisenberg picture evolution of an operator X, so 
that 

\im UH\t)XU{\t) = UlxUf (D.12) 



Before describing the resulting unitary process, it is worth stressing several advan- 
tages of the quantum Wong-Zakai theorem. So long as i7j„t(A, t) and the A — > limit 
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are physically justifiable, then the specifics of the total state ptot are almost irreverent. 
The only constraint is that the total state must be expressible in terms of convergent 
sequence of the matrix elements lim;^_^o (V'i <^ ^[fil'^)]! Ptot \4>j e[gj(A)]). This means 
that we allowed the possibility of nonclassical superpositions. A second advantage 
is that the presence of the scattering interaction Eij allows for a much broader class 
of interactions than the linear interactions typically considered in quantum optics. 
While this dissertation will ultimately be considering a linear Hamiltonian {Eij will 
be negligibly small) the fact that the theory could consider a system coupled to an in- 
stantaneous number operator a'^(A, t) a* (A, t) in a "white noise" limit is no small feat. 
One possible example is to engineering a quantum-optical router, where for some set 
of modes the operators Eij coherently scatter quanta in a system dependent way. 
Finally, much can be said for the fact that the limiting interaction is still described 
by a unitary operation. While we have constructed the limiting field operators in 
terms of a measurement timescale, we have not specified what kind of measurement 
we will be performing. Formulating a conditional estimate for the system given a 
measurement of the field is one of the main purposes of Chap. 3 but at the level of 
the system field interaction, everything remains fully coherent. 

D.2.1 Quantum stochastic calculus and operator ordering 

In proving the quantum Wong-Zakai theorem, Gough also found an intuitive corre- 
spondence between operator orderings and quantum stochastic differential equations. 
In order to describe the limiting propagator Ut as a solution to a standard Ito form 
QSDE we need to review this correspondence. Ultimately, the correspondence is 
between quantum Stratonovich equations and time- ordered solutions to a given re- 
cursive differential equation. Conversely, a quantum Ito equation is identified with 
a normally- ordered solution [43]. 

The solution to the Schrodinger equation with a time-dependent Hamiltonian is 
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given by a time-ordered exponential, 

U(X,t)^fexp(^-'-J^ dsHir^t(X,s)y (D.13) 
The time-ordered exponential is a compact short hand for the iterated integrals, 

Texp ^-^ J dsHirit{\s)^ = 

J dtn...J dti HintiX,tn)...HintiX,ti). (D.14) 

Note that Hinti^, t) need not commute with Hint{\, s) and thus the operator ordering 
is critical in this expression. There is nothing in the A — )> limit that changes the 
operator ordering and so identifying a Stratonovich equation with a time-ordered 
equation is simply a statement of this fact. The heart of the quantum Wong-Zakai 
theorem is relating this time-ordered exponential to a quantum Ito integral. As the 
Wong-Zakai theorem considers the limit of matrix elements between two exponen- 
tial vectors, this relation is made by comparing the matrix elements between this 
expression and the matrix element of an iterated Ito integral. 

Sec. B discusses the classical Ito integral and shows how an integral Xt — /J bg dwg 
for a time-adapted process ht leads to the Ito rules of calculus and how it crucially de- 
pend upon the statistical independence of ht from dwt- Not surprisingly the quantum 
Ito integral also relies on a similar independence of the integrand from the differen- 
tial. As Sec. 2.5.1 discussed, the continuous-time tensor product decomposition 
allows for defining field operators d[x{t,t+dt\^j\ — ^t+dt ~ ^t' which commute with 
any operator adapted to the the time interval [0,t]. The quantum Ito integral with 
respect to dAt is defined as 

X, dAi ^ Jim J2 ^U-. (4 - 4- J (D-15) 

and likewise an integral with respect to dA'l'^ is 
f-t " 

Z,^ Ys dA'J ^ Jim J2 Yu-. (4 - 4- J (D-16) 

Jo .^^ 
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Taking the matrix element of Yt between two exponential vectors results in 



e[f] 



e\h] 



ds (e[f]|X,|e[h]) hj{s) 



(D.17) 



and equivalently 

{e[{]\Zt\e[h]) = f dsf*{s) (e[f]|y,|e[h]). (D.18) 
Jo 

Eq. (D.18) is valid because Yg is time-adapted and therefore commutes with dA^J^ . 
This commutating property carries over to the iterated integral, so by substituting 
in for Yg, 

(e[f]|Z,|e[h])= f ds r ds' flis') {e[i]\Xg,\e[h]) hj{s). (D.19) 
Jo Jo 

If we hypothesize the existence of an operator dj{t) by the eigenvalue relationship 



^At)m) = m |e[f]) 



(D.20) 



we then have 



(e[f] I Zt |e[h]) ^ J^dsJ^ ds' (^e[{] | dl{s') X^, %(s) |e[h]^ . 



(D.21) 



This relation holds for any iterated Ito integral as long as the integrand is expressed in 
normal order, with all of the creation operators on the left and all of the annihilation 
operators on the right. But as we have 

POO 

lim a,(A,i)|e[f]) = lim / ds cjk{X,t - s) Ms) |e[f]) = /,W|e[f]), (D.22) 

we will ultimately find an equivalence between the operator dj{t) and the limiting 
form of dj{X, t). 

The proof of the quantum Wong-Zakai theorem follows the procedure of con- 
verting the time-ordered exponential into normal order, showing that the matrix 
elements converge to a finite quantity and then proving a correspondence with an 
equivalent Ito form QSDE. 
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D.2.2 Gauge freedom in the Ito correction 

The difference between a Stratonovich equation and an Ito equation is often called 
the Ito correction term. As we have identified an Ito equation with the normally order 
version of the iterated integral, the Ito correction term is intimately related to this 
conversion. Converting any product of field operators into normal order, is given by 
Wick's theorem [91]. It states that any product of creation and annihilation operators 
can be written as the sum over the normal ordering of all possible contractions 
between all pairs of operators. A contraction between the operators a and b is 
defined as 

a'b' = ab- : ab : (D.23) 

where : ab : is the normal ordering of the two operators. By Wick's theorem we can 
write the product 

dbc ^ -.dbc: + : d'b'c : + : d'bc' : + : db'c' : . (D.24) 

For the boson operator considered here, the only nonzero contraction is 

a'(A,t)a]*(A,s) = [di{X,t), d]{X,s)] = Cij{X,t- s). (D.25) 

The heart of finding the equivalent Ito QSDE from the time-ordered exponential 
is to first apply Wick's theorem to each term in the time-ordered exponential, then 
take the A — )■ limit, and finally re-sum the series. We will not be reproducing this 
result here, where the details of such a limit can be found in the following references. 
In the absence of the scattering terms the proof is detailed in the book by Accardi 
et al. [55] . The scattering terms were subsequently added by Gough [92] . However, 
one important aspect of the limit must be discussed as it affects the final limiting 
QSDE, as well as takes its root in the physical origin of Cij{X, t — s). 

In each term of the time-ordered exponential, the operators on the right are always 
constrained to be at an earlier time than the operators on the left. Therefore when 
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applying Wick's theorem, the contraction a* (A, t)aj'{\, s) will always be constrained 
to have t > s. This constraint means that when A — )■ 0, only half of the Cij{X, t — s) — )■ 
S{t — s) limit will apply. It is often the case that when a causal constraint is applied 
to a delta function limit, an additional complex term appears involving a Cauchy 
principle value. For each Cij{X, r) the extra complex term is called a gauge freedom 
and generates, among other things, an effective level shift in the £'00 term. 

As a concrete example, consider the second order term in the time-ordered ex- 
pansion 

(^) I "^^l ^^^-*(^'^)^-*(^'^)- (^-26) 
The operator product Hint{X,t)Hint{X, s) contains 16 terms with at most 4 field 
operators (from the scattering terms) and in the case of EQo{t) Eqq^s), no operators. 
One part of this expression is the integral 

-Yl j J dsEoiai{X,t)Ejoa]{X,s). (D.27) 

ij 

Applying Wick's theorem means that 

/ dt dsEoidi{X,t) Ejod](X,s) ^ 
■ ■ Jo Jo 

~ / dtj^ds Eoi Ejo (: di{X, t)d]{X, s) : + Cij{X, t - s)) . (D.28) 



This commutator term on the right-hand side will ultimately contribute to the Ito 
correction term and generate the gauge shift, as long as it survives the A — > hmit. 

Therefore the most basic contribution is made by the limit 

lim^ dt dsCij{X,t - s). (D.29) 
By substituting Eq. (D.8) for Cij{X,t — s) and dropping the term proportional to 

1/^0, 

lim dt ds Cij{X,t - s) ^ 5ij lim dt ds cp^"^ ^ ip^"^ {t - s) . (D.30) 
-^-^oJo Jo "-^^Jo Jo 
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In Sec. 2.3.2 we used the example of (p^'^'> as a real-valued Gaussian with mean zero 
and standard deviation a. In this case, it is easy to show that 

cp^"^ ★ (^(") (t _ s) = 1 ^ ((t _ s)/a) (D.31) 

where (p^^^ is a mean zero Gaussian with unit variance. In fact this is a general 
property often used in distribution theory where if Jj^dt(p^^\t) — 1 then 

]im -ip^'\t / a) ^S{t). (D.32) 

<T->-0 (J 

In this replacement Eq. (D.31) is also satisfied. Using this relation in the right-hand 
side of Eq. (D.30), 

lim dt dsCij{X,t — s) — Sij lim dt ds - ip'^^^ -k ip^^'' {{t — s)/a). 
^-^^Jo Jo '^^^Jo Jo 

(D.33) 

This limit is easily evaluated by making the change of variables t = {t — s)/a, 

5ij lim dt dt^^^^i<^^^^ {i)^5ijT (f). (D.34) 

^-^0 Jo Jo Jo 

If is a normalized real-valued distribution then 'p'^^^ -k ip^^'> (t) = (p^^^ -k (p^^^ (— t) 

and so 



poo 1 roo 

/ di>(i) * (^(1) it} = U dt^^^^ * ^^^^ (t) 
Jo 2 J_^ 



oo 

oo 



_ 1 

~ 2' 

The change of variables in Eq. (D.34) is the delta correlation limit but because 
of the timc-ordcrcd integration, wc obtain the factor of ^. However in the general 
case, (p^^'> need not be real-valued. For our example involving quasi-monochromatic 
fields, it is sufficient for (/? to be a real-valued function as it is simply a mathematical 
tool representing a limit on the rate of change of the arbitrary complex functions 
f e C^R+^^C^. 
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In the general Wong-Zakai limit Cij{\, t) is not assumed to be real, only that the 
criteria of Eq. (D.5) are satisfied. From Eq. (D.5c) we have that 



oo 



lim J dt Cij{X,t) = 6ij, (D.36) 
meaning that we can define the potentially complex constants 

poo 

Kij = lim / dtCji{X,t) 

POO pO 

'^ij = / dtc*i{X,t) = lim / dtCji{\t) 
where we used the fact that c*j(A, t) — Cji{X, —t). Combining these two results means 



(D.37) 



^ij ^ K'ij + (D.38) 

In the case of a real- valued smoother Kij — ^Sij, but the general case allows for a 
complex coefficient. In complex analysis there is a general relation that 

POO 

/ dt e"*'^* = TT d{uj) - i V.V. [I] (D.39) 

where V.V. denotes taking the Cauchy principal value. Expressing 9?*^^^ -k 99*^^^ it) in 
the frequency domain means that 

|2 



K'ij = Sij / dt duj \ip^^\uj)\ 



(POO POO 
TT / duj\^^^\u)\ 6{u)-iV.V. du 
J —00 J —00 



|^W(a;)f 



(D.40) 



2 

The requirement of Eq. (D.38) implies that tt |y9^^^(0)| = |. The principal value 
will be zero for a symmetric power distribution (i.e. for real <^^^^) but it is nonzero 
in general. The remaining complex coefficient is what Gough refers to as a gauge 
freedom as it depends upon the nature of ip^^\ Here we will assume that cp^^^ is 
real- valued and so k^j = k,*j = 
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D.3 The Limiting Propagator 



We just considered one part of the second order term in the time-ordered exponential 
and how through normal ordering it develops a nonzero correction. As this is just 
one part of the total time-ordered exponential, more complicated expressions are 
generated, involving iterated integrals of the form 



Depending upon the relative order of the times, these integrals may or may not 
converge to zero as A — )■ 0. It turns out that the only terms that are nonzero have 
time consecutive integrals, meaning that for each contraction we must have Cjj(A, t„) 
be evaluated at t„ = t„ — tn-i [92]. In the above example, t„ = i„ — t„_3 and 
^5 = ^5 — ^1 are not time consecutive intervals and so Eq. (D.41) converges to zero. 
Upon identifying the class of nonzero integrals, it is possible to then re-sum the 
expansion, which in general generates a Neumann series [43]. 

Once the A — > limit is taken, the normally ordered propagator can be identified 
and its equivalent Ito form QSDE written down. A general QSDE for the unitary 

propagator Ut can be written as 



The coefficients Ga/s are constrained to insure unitarity. Appendix C.l discusses 
these constraints at length and shows how they can be re-expressed in terms of the 
operators -S", L and H. Eq. (C.26) gives this conversion. We note here that H is the 
part of the unitary that is completely uncoupled to the field operators and so we will 

have the correspondence H = Eqq. 

In terms of the system operators Ea/s {ck, (3 — 0,1, ... ,d) defining Hint{^, t) in 
Eq. (2.145), the general limiting coefficients are 




(D.41) 




(D.42) 




(D.43) 
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where i and j start from 1 and we introduced the following notation. In the most 
general case, we have a d x d matrix of constants K = well as a matrix of 

operators E = E^. A Neumann series is the operator- valued generalization of a 
geometric series, so that for an operator T, 

oo 

^r" = (i-r)-i (D.44) 

n=0 

is well defined whenever 1 — T is invertible. The time consecutive contractions ul- 
timately generate a Neumann series where T is the matrix of operators — iKE = 
—iKijEjf;. The limiting coefficient then involves the i,j component of the opera- 
tor/matrix inverse 1/(1 -|- iKM). 
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